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STREPTOCOCCAIi HEAT SHOCK PROTEINS 
MEMBERS OF THE HSP70 FAMILY 



5 



TECHNICAIj field of the IITVENTION 



This invention relates to novel heat shock 
proteins of Streptococcus pneumoniaer Streptococcus 
pyogenes and Streptococcus agalactiae and immiinologically 



10 related polypeptides, which provide the basis for new 



useful in the treatment, prevention emd diagnosis of 
disease. More particularly, this invention relates to 
heat shock proteins of S. pneumoniae, S. pyogenes and S. 

15 agalactiae, members of the HSP70 faimily which have an 
apparent molecular mass of 70-72 kilodaltons, to the 
corresponding nucleotide and derived amino acid sec3uences, 
to recombinant DNA methods for the production of 
HSP7 0/HSP72 and immunologically related polypeptides, to 

20 antibodies that bind to these HSP's, and to methods and 
compositions for the diagnosis, prevention and treatment 
of diseases caused by S, pneumoniae and related bacteria, 
such as Streptococcus pyogenes and Streptococcus 
agalactiae 



in humans, especially among infants, the elderly and 
30 immunoconpromised persons . It is a bacterium frequently 
isolated from patients with invasive diseases such as 
bacteraemia/septicaemia, pneumonia, and meningitis with 
high morbidity and mortality throughout the world. 
Although the advent of antimicrobial drugs has reduced the 
35 overall mortality from pneiamococcal diseases, the presence 
of resistant pneumococcal organisms has become a major 
problem in the world today. Effective pneumococcal 
vaccines could have a major impact on the morbidity and 
mortality associated with S. pneumoniae disease. Such 



immuno therapeutic, prophylactic and diagnostic agents 



25 



BACKGROUND OF THE INVENTION 



S. pneumoniae is an important agent of disease 
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15 



20 



25 



30 



vaccines would also potentially be useful to prevent 
otitis media in infants and young children. 



factors are potentially important in the pathogenesis of 
disease [G.J. Boulnois, J . Gen . Microbiol . . 13 8, pp. 249- 
259 (1992); C.J. Lee et al . , Crit. Rev. Microbiol. . 18, 
pp. 89-114 (1991)]. The capsule of the pneumococcus , 
despite its lack of toxicity, is considered to be the sins 
Qua non of pneumococcal virulence. More than 80 
pneumococcal capsular serotypes are identified on the 
basis of antigenic differences. Antibodies are the 
mechanism of protection and the importance of anticapsular 
antibodies in host defenses against S, pneumoniae is well 
established [R. Austrian, Am. J. Med. , 67, pp. 547-549 
(1979)]. Nevertheless, the currently available 
pneumococcal vaccine, comprising 23 capsular 
polysaccharides that most frequently caused disease, has 
significant shortcomings such as the poor immunogenicity 
of capsular polysaccharides, the diversity of the 
serotypes and the differences in the distribution of 
serotypes over time, geographic areas and age groups. In 
particular, the failure of existing vaccines to protect 
young children against most serotypes has spurred 
evaluation of other S. pneumoniae components. Increasing 
evidence indicates that certain pneumococcal proteins may 
play an active role both in terms. of protection and 
pathogenicity [J.C. Paton, Ann. Rev. Microbiol. , 47, 
pp. 89-115 (1993)]. So far, however, only a few S. 
pneumoniae proteins have been studied. This might result 
from the lack of protein-specific antibodies which renders 
difficult the study of the role of protein antigens in 
protection and pathogenicity. It is believed that the 
pneumococcal protein antigens are not very immunogenic and 
that most antibody responses are to the phosphocholine and 
the capsular polysaccharides [L.S. McDaniel et al., J. 
Exp. Med. . 160, pp. 386-397 (1984); R.M. Krause, Adv. 
Immunol . , 12, pp. 1-56 (1970); D.G. Braun et al . , J. Exp. 



It is clear that a number of pneumococcal 
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Med. , 129, pp. 809-830 (1969)]. In a study using X-linked 
iiranunodef icient mice, which respond poorly to carbohydrate 
antigens and to phosphocholine , but make relatively normal 
responses to protein antigens, the frequency for obtaining 
5 monoclonal antibodies reactive with pneumococcal protein 
antigens was less than 10%, thus suggesting that S, 
pneumoniae proteins are poor immunogens [McDaniel et al . , 
supra ] . 

Streptococcus agalactiae, also called Group B 

10 Streptococcus {GBS),is the most common cause of sepsis 

(blood infection) and meningitis in newborns. GBS is also 
a frequent cause of newborn pneumonia. Approximately 
8,000 babies in the United States get GBS disease each 
year; 5%-15% of these babies die. Babies that survive, 

15 particularly those who have meningitis, may have long-term 
problems, such as hearing or vision loss or learning 
disabilities- In pregnant women, GBS can cause urinary 
tract infections, womb infections (amnionitis, 
endometritis) , and stillbirth. Among women who are not 

20 pregnant and men, the most common diseases caused by GBS 
are blood infections, skin or soft tissue infections, and 
pneumonia. Approximately 2 0% of men and nonpregnant women 
with GBS disease die of the disease. GBS infections in 
both newborns and adults are usually treated with 

25 antibiotics (e.g., penicillin or aitpicillin) given 
intravenously. Most GBS disease in newborns can be 
prevented by giving certain pregnant women antibiotics 
intravenously during labor. Vaccines, to prevent GBS 
disease are being developed. In the future, it is 

30 expected that women who will be vaccinated will make 

antibodies that cross the placenta and protect the baby 
during birth and early infancy. 

Since the 1980s, Streptococcus pyogenes, also 
called Group A Streptococcus (GAS) is reemerging as a 

35 cause of severe diseases which would be due to an increase 
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in virulence of the organism. GAS causes pharyngitis, 
commonly called "strep throat", and skin infections 
(impetigo, erysipelas/cellulitis ) . "Strep throat" and 
impetigo can lead to glomerulonephritis (kidney damage) . 
5 Approximately 3% of "strep throat" infections result into 
rheumatic fever (migrating arthritis) whose complications 
include chorea (neurological symptoms) and, in 50% of the 
cases, rheumatic heart disease (heart valve damage) with 
endocarditis as a possible long term consequence. It is 

10 important to treat impetigo and "strep throat" with 

antibiotics to prevent the development of complications. 
Infection with toxin-producing strains can result in 
scarlet fever (diffuse rash and fever) or in the extremely 
severe streptococcal toxic shock syndromes (TSS; GAS have 

15 been termed 'flesh eating bacteria') which are 

characterized by the rapid development of shock and 
multiple organ system failure, TSS have a 30 to 70% 
fatality rate in spite of aggressive treatment involving 
the removing of the focus of bacterial infection and 

20 antibiotic therapy. The incidence of TSS is 10 to 20 
cases per 100,000. No vaccine against GAS is presently 
available . 

Heat shock or stress proteins C'HSPs") are among 
the most highly conserved and abundant proteins found in 

25 nature [F.C. Neidhardt et al . , Ann . Rev . Genet . , 18, 

pp. 295-329 (1984); S. Lindquist, Ann. Rev, Biochem. , 55, 
pp. 1151-1191 (1986)]. They are produced by all cells in 
response to various physiological and nonphysiological 
stimuli. The heat shock response, in which a sudden 

30 increase in temperature induces the synthesis of HSPs, is 
the best studied of the stress responses. Other 
environmental conditions such as low pH, iron deficiency 
and hydrogen peroxyde can also induce HSPs . The HSPs have 
been defined by their size, and members of hsp90, hsp7 0, 

35 and hsp60 families are among the major HSPs found in all 
prokaryotes and eukaryotes. These proteins fulfill a 
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variety of chaperon functions by aiding protein folding 
and assembly and assisting translocation across membranes 
[C. Georgopoulos and W.J. Welch, Ann. Rev. Cell. Biol. ^ 9, 
pp. 601-634 (1993); D. Ang et al . , J. Biol . Chem. , 266, 
5 pp. 24233-24236 (1991)]. As molecular chaperons and 

possibly via other mechanisms, HSPs are likely involved in 
protecting cells from the deleterious effects of stress. 
The fact that several virulence factors are regulated by 
environmental conditions suggests a role for HSPs in 

10 microbial pathogenicity [J.J. Mekalanos, J. Bacteriol . , 
174, pp. 1-7 (1992); P.J. Murray and R.A. Young, J. 
Bacteriol. , 174, pp. 4193-4196 (1992)]. In that respect, 
recent studies on Salmonella species suggest that the 
stress response might be critically linked to the ability 

15 of intracellular pathogens to initiate and sustain an 
infection [N.A. Buchmeir and F. Heffron, Science , 248, 
pp. 730-732 (1990); K.Z. Abshire and F.C. Neidhardt, 
J. Bacteriol. , 175, pp. 3734-3743 (1993); B.B. Finlay 
et al.. Science , 243, pp. 940-943 (1989)]. Others have 

20 demonstrated that lysteriolysin, an essential . vimlence 
factor in L. monocytogenes, is induced under heat shock 
conditions [Z. Sokolovic and W. Goebel, Infect . Immun . , 
57, pp. 295-298 (1989) ] . 

Evidence is now accumulating that HSPs are major 

25 antigens of many pathogens. Members of the hsp60 family, 
also called GroEL-related proteins for their similarity to 
the E. coli GroEL protein, are major antigens of a variety 
of bacterial pathogens including Mycohacteriuun leprae and 
MycoJbacterium tuberculosis [D, Yoimg et al . , Proc . Natl. 

30 Acad. Sci. USA , 85, pp. 4267-4270 (1988)], liegrioneila 
pneumophila [B.B. Plikaytis et al., J . Clin . Microbiol . , 
25, pp. 2080-2084 (1987)], Borrelia burgdorferi [B.J. Luft 
et al., J . Immunol . , 146, pp. 2776-2782 (1991)], and 
Chlamydia trachomatis [E.A. Wagar et al . , J. Infect . Pis . , 

35 162, pp. 922-927 (1990)]. This antigen is a homologue of 
the ubiquitous "common antigen", and is believed to be 
present in every bacteri\im [J.E. Thole et al., Microb. 
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Pathogen. , 4, pp. 71-83 (1988). Antibodies to the members 
of the hsp70 family, or DnaK-related proteins, have also 
been described for several bacterial and parasitic 
infections [Young et al . , supra ; Luft et al . , supra ; D.M. 
5 Engman et al . , J. Immunol. , 144, pp. 3987-3991 (1990); 
N.M. Rothstein et al., Molec. Biochem. Parasitol. , 33, 
pp. 229-235 (1989); V. Nussenzweig and R.S, Nussenzweig, 
Adv. Immunol. , 45, pp. 283-334 (1989)1. HSPs can elicit 
strong B- and T- cell responses and it was shown that 20% 

10 of the CD4* T- lymphocytes from mice inoculated with M. 
tuberculosis were reactive to the hsp60 protein alone 
[S.H.E. Kaufman et al . , Eur. J. Immunol. , 17, pp. 351-357 
(1987)]. Similarly, 7 out of a collection of 24 
monoclonal antibodies to M. leprae proteins recognized 

15 determinants on hspSO [H.D, Engers et al . , Infect. Immun. , 
48, pp. 603-605 (1985)]. It seems that the immune 
response to stress proteins might play an important role 
in protection against infection. Consistent with that is 
the demonstration that antibodies and T cells reactive 

20 with microbial HSPs can exhibit neutralizing and 

protective activities [A. Noll et al . , Infect. Immun. , 62, 
pp. 2784-2791 (1994); and S.L. Danilition et al . , Infect. 
Immun . , 58, pp. 189-196 (1990)]. The immunological 
properties of stress proteins make them attractive as 

25 vaccine components and several HSPs are presently being 

considered for preventing microbial infection and treating 
cancer. So far, however, studies have focused on 
intracellular pathogens such as Mycobacteria, Salmonella, 
Chlamydia and several parasites. Information concerning 

30 the heat shock protein antigens in extracellular gram- 
positive bacteria is far less documented. In S. 
pneumoniae, S. pyogenes and S. agalactiae, neither the 
heat sliock proteins nor their gene structures have been 
identified. 



referred to above by providing novel heat shock proteins 



35 



DISCLOSURE OF THE INVENTION 



The present invention addresses the problems 
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from S. pneumoniae, S. pyogenes and S. agralactiae, and 
immunologically related polypeptides. Also provided are 
DNA seq[uences that code for the foregoing polypeptides, 
vectors containing the polypeptides, unicellular hosts 
5 transformed with those vectors, and a process for making 
substantially pure, recombinant polypeptides. Also 
provided are antibodies specific to the foregoing 
polypeptides. The polypeptides, DNA sequences and 
antibodies of this invention provide the basis for novel 
10 methods and pharmaceutical compositions for the detection, 
prevention and treatment of disease. Particularly, this 
invention provides a novel vaccine based on fragments of 
these polypeptides that are specific to streptococcal 
strains . 

15 The novel heat shock protein is the 

approximately 72 kDa heat shock protein of Streptococcus 
pneumoniae {"HSP72") (SEQ ID NO: 5), the approximately 70 
IcDa heat shock protein of Streptococcus pyogenes ("HSP70") 
(SEQ ID NO: 20) and the approximately 70 kDa heat shock 

20 protein of Streptococcus agalactiae ("HSP70") (SEQ ID 

NO: 22), including analogues, homologues, and derivatives 
thereof, and fragments of the foregoing polypeptides 
containing at least one immunogenic epitope. Preferred 
fragments of HSP7 0/72 include the C- terminal portion of 

25 the HSP70/72 polypeptides. More particularly , it includes 
the C_terminal 169-residue fragment ("C-169") (residues 
439-607, SEQ ID NO : 5 ) , the C-terminal 151-residue fragment 
("C-151") (residues 457-607, SEQ ID No : 5 ) , and smaller 
fragments consisting of peptide epitopes within the C-169 

30 region. Particularly preferred fragments within the C-169 
region of HSP72 include the peptide sequences 
GFDAERDAAQAALDD (residues 527-541 of SEQ ID NO: 5) and 
AEGAQATGNAGDDW (residues 586-600 of SEQ ID NO:5), which 
are exclusive to HSP72 of 5'treptococcus pneumoniae. Even 

35 more preferred are fragments that elicit an immune 
reaction against S. pneumoniae, pyogenes and 5. 
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agraiactiae but do not provoke auto-immune reaction in a 
hiiman host. Such fragments may be selected from the 
following peptides : CS870, CS873, CS874, CS875, CS876, 
CS877, CS878, CS879, CS8S0 , CS882, MAPI, MAP2 , MAP3 and 
5 MAP4 (see TABLE 5, supra) . 

Preferred antibodies of this invention are the 
Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4 monoclonal 
ctntibodies ("MAbs"), which are specific to HSP72. 

More preferred antibodies are the F2-Pn3.2 and 
10 F2-Pn3-4 monoclonal anibodies that are specific to both 
HSP 7 0 and HSP72 . Even more preferred are the Fl-Pn3.1 
antibodies that are specific for Streptococcus pneumoniae. 

The preferred polypeptides and antibodies of 
this invention provide the basis for novel methods and 
pharmaceutical compositions for the detection, prevention 
and treatment of pneumococcal diseases. 



15 



BRIEF DESCRIPTION OF THE DRAWINGS 

20 FIG. 1 depicts a fluorogram, which shows the 

effect of heat shock on S. pneumoniae protein synthesis. 
The cell extracts in panel A are S. pneumoniae type 6 
strain 64. The cell extracts in p^anel B are S. pneumoniae 
type 4 strain 53 . The cell extracts in the odd numbered 

25 lanes were incubated at 37®C. The cell extracts in the 
even numbered lanes were incubated at 45°C for 5 minutes. 
The cell extracts were then labeled with [^^S] methionine 
for 10 minutes (lanes 1, 2 and 7, 8), 30 minutes {lanes 3, 
4 and 9, 10), or 60 minutes (lanes 5, 6). Molecular mass 

30 markers in kilodaltons are shown to the left. The 

positions of HSP80, HSP72 and HSP62 are shown by arrows at 
the right-hand side of each panel. 

FIG, 2 is a graphical depiction of a corrparison 
of the electrophoretic profiles of I^^S]methionine-labeled 

35 proteins in S. pneumoniae in the presence ( ) or 

absence ( ) of exposure to heat shock. Densitometric 

tracings were determined by measuring the relative optical 
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density (Y axis) vs. the mobility of labeled protein bands 
(X axis). The densitometric scans of the SDS PAGE of FIG. 
1, lanes 1 and 2, is shown. 

FIG, 3 depicts a fluorogram, which shows the 
5 pneumoniae protein antigens immunoprecipitated by sera 

from mice immunized with detergent-soliible S. pneumoniae 
protein extract- [^^S]methionine-labeled proteins from 
S. pneumoniae grown at 37°C and incubated at 37°C (lanes 3, 
5, 7 and 9) or heat-shocked at 45°C (lanes 4, 6, 8 and 10) 

10 were immunoprecipitated with sera from mouse 1 (lanes 3 to 
6) or mouse 2 (lanes 7 to 10) and then analyzed by SDS- 
PAGE and f luorography . The sera were tested after the 
first (lanes 3,4 and 7,8) and after the second (lanes 5,6 
and 9,10) immunization. Cell lysates from [^^Sjmethionine- 

15 labeled non heat-shocked and heat-shocked S. pneumoniae 

are shown in lanes 1 and 2, respectively. The position of 
HSPs is indicated by the arrows at the left of the 
fluorogram. 

FIG. 4 depicts a fluorogram, which shows the 

20 S. pneumoniae protein antigens imm\inoprecipitated by sera 
from mice immunized with heat-killed pneumoniae 
bacteria. [^^S] methionine- labeled proteins from 
S. pneumoniae grown at 37°C and incubated at 37**C (lanes 3, 
5 and 7) or heat-shocked at 45°C (lanes 4, 6 and 8) were 

25 immunoprecipitated with sera from mouse 1 (lanes 3,4), 
mouse 2 ( lanes 5,6) or mouse 3 ( lanes 7 , 8 ) and then 
analyzed by SDS-PAGE and f luorography . Sera were tested 
after the second immunization only. Cell lysates from 
[^^S]methionine-labeled non heat- and heat-shocked 

30 S. pneumoniae are shown in lanes 1 and 2, respectively. 
The position of HSPs is indicated by the arrows at the 
left of the f luorogr am . 

FIG. 5 depicts a photograph, which shows the 
S. pneumoniae antigens detected by Western blot analysis. 

35 Whole cell extracts were probed with sera from 15 mice 
(lanes 1-15) immunized with heat-killed S. pneumoniae 
bacteria. Lane 16 shows the HSP72 protein detected by MAb 
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Fl-Pn3.1. In panel A, the sera were tested after the 
second immunization. In panel B, the reactivity of 4 out 
of 15 sera tested after the first immunization is shown. 
The positions of 53.5 kDa- and 47 kDa-protein bands are 
5 indicated by the bars at the left. The position of HSP72 
is shown by the arrows at the right of each panel. 

FIG. 6 depicts a fluorogram showing the 
specificity of MAb Fl-Pn3.1 for HSP72 . [^^SJmethionine- 
labeled proteins of S. pneumoniae in the absence (lanes 1, 
3 and 5) or presence (lanes 2, 4 and 6) of exposure to 
heat shock were immunoprecipitated with IgG2a-control MAb 
(lane 3,4) or Fl-Pn3.1 (lane 5,6) and then analyzed by 
SDS-PAGE and f luorography . Cell lysates from 
[^^Slmethionine-labeled non heat-shocked and heat-shocked 
S. pneumoniae are shown in lanes 1 and 2, respectively. 
The position of HSPs (all three) is shown by the arrows at 
the left of the fluorogram. 

FIG. 1, panel A, depicts an immunoblot, which 
shows the reaction of heat-shocked and non heat-shocked 
[^^S]methionine-labelled S. pneumoniae cell extracts with 
MAb Fl-Pn3.1. Lane 1 contains heat-shocked cell lysates 
(45°C) . Lane 2 contains non heat-shocked cell lysates 
(37°C) . Panel B depicts a fluorogram of the immunoblot 
shown in panel A. 

FIG. 8 depicts a Western Blot, which shows 
subcellular localization of S. pneumoniae HSP72 . Sample 
containing 15 ^g protein of membrane fraction (lane 1) and 
cytoplasmic fraction (lane 2) of pneumoniae were 
electrophoresced on SDS-PAGE transferred to nitrocellulose 
and probed with MAb Fl-Pn3.1, 

FIG. 9 is a photograph of an immunoblot showing 
the reactivity of recombinant fusion proteins containing 
the C-169 region of S. pneumoniae HSP72 with MAb Fl-Pn3.1. 
Lane 1 contains whole cell extracts from S. pneumoniae 
strain 64 probed with HSP72-specif ic MAb Fl-Pn3 . 1 . 
Lanes 2 and 3 contain phage lysates from E. coli infected 
with ?uJBD17 cultured in the presence (+) or absence (-) of 
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IPTG and probed with HSP72-specif ic MAb Fl-Pn3,l. Lanes 4 
and 5 contain phage lysates from E. coli infected with 
X^BD7 cultured in the presence ( + ) or absence (-) of IPTG 
and probed with HSP72-specif ic MAb Fl-Pn3.1. Molecular 
5 mass markers are shown to -the left. The positions of the 
74kDa- and 160 kDa-reactive proteins are shown on the left 
and on the right, respectively. 

FIG. 10 is a schematic representation of the 
restriction map of the HSP72(DnaK) and Fuc loci and 

10 inserts of recombinant clones , The relationships between 
DNA fragments are shown with respect to each other. 
FIGS. lOA and IOC illustrate the restriction map of the 
HSP72(DnaK) and Fuc loci, respectively. FIG lOB 
illustrates the inserts of the various phages and plasmids 

15 described in Example 3. H(Hindlll); E(EcoRI); V(EcoRV); 
P(Pstl); and X(XhoI) indicate positions of restriction 
endonuclease sites. DNA fragments on the HSP72/DnaK locus 
(■) ; the Fuc locus (///); and fragments used as probes in 
the Southern blot analyses (!:| ) are indicated. 

20 FIG. 11 depicts the SDS-PAGE and Western blot 

analyses of the recombinant 74 kDa protein. Whole cell 
extracts from E. coli tramsformed with plasmids pJBD179 
(lane 1), pJBDfSl (lanes 2 and 3) and pJBDf62 (lane 4 and 
5) and cultured in presence ( + ) or absence (-) of IPTG 

25 were sxibjected to 10% polyacrylamide gel electrophoresis. 
The proteins were then visualized by Coomassie Blue 
staining (A) or Western blotting (B) using HSP-specific 
MAb Fl-Pn3.1. Molecular mass markers in kilodaltons are 
shown to the left. The arrow at the left-hand side of i 

30 each panel marks the 74 kDa protein marker. 

FIG. 12 depicts the detection of native and 
recombinant HSP72 antigens by Western blot analysis . 
Whole cell lysates from E. coli transformed with plasmids 
pJBDkSl (lanes 1 and 3) and pJBD291 (lane 2) and cell 

35 lysates from S. pneumoniae strain 64 (lane 4) were 

svibjected to 10% polyacrylamide gel electrophoresis and 
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were electrotransf erred to nitrocellulose. The immunoblot , 
was probed with HSP72-specif ic MAb Fl-Pn3.1. 

FIGS. 13A-13D depict a comparison of the 
predicted amino acid sequence of the pneumoniae HSP72 
5 open reading frame (HSP72 SPNEU) with those previously 
reported for the following HSPVO/DnaK proteins: ECOLI, 
Escherichia coli; BORBU, Borrelia burgdorferi; BRUOV, 
Brucella ovisj CHLPN, Chlaxnydia pneumonia; BACME, Bacillus 
megatorium; BACSU, Bacillus suhtilis; STAAU, 

10 Staphylococcus aureus; LACLA, Lactococcus lac t is; and 

MYCTU, Mycobacterium tuberculosis . Only mismatched amino 
acids are indicated. Identical and conserved amino acids 
are boxed and shadowed, respectively. 

FIG. 14 depicts a photograph of an SDS-PAGE, 

15 which shows the recombinant S. pneumoniae HSP72 purified 
by affinity chromatography. Supernatant fractions from 
E. coli (pJBDkSl) lysates (lane 2) and 20 pg of 
immxmoaffinity-purif ied HSP72rac (lane 3) were subjected to 
10% polyacrylamide gel electrophoresis. The proteins were 

20 then visualized by Coomassie Blue staining. Lane 1 shows 
the migration of molecular mass markers (106 kDa, 80 kDa, 
49.5 kDa, 32.5 kDa, 27.5 kDa and 18.5 kDa) . 

FIG. 15 depicts a photograph of SDS-PAGE, which 
shows the recombinant S. pneumoniae C~169 fragment 

25 purified by solubilization of inclusion bodies. Various 
amounts of purified C-16 9 protein (lane 1, 5 ^g; lane 2, 
2.5 ug; and lane 3, 1 pg) and whole cell lysates from 
E. coli transformed with plasmids pDELTAl (lane 4) and 
pJBDAl (lane 5) were subjected to 10% polyacrylamide gel 

30 electrophoresis. The proteins were then visualized by 
Coomassie Blue staining. 

FIG- 16 is a graphical depiction of the survival 
curve of Balb/c mice protected from S. pneumoniae 
infection by immunization with HSP72r«o. Data are 

35 presented as the per cent (%) survival over a period of 
14 days for a total of 10 mice per experimental group. 
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FIG. 17 is a graphical depiction of the survival 
curve of Balb/c mice protected from S, pneumoniae 
infection by immunization with C-169xr©c- Data are 
presented as the per cent (%) survival over a period of 14 
5 days for a total of 10 mice per experimental group. 

FIG. 18 is a map of plasmid pURV3 containing C- 
ISl^ec' the coding region for the 151 amino acids at the 

carboxyl end of the HSP72 of S. pneumoniae; Ampi^, 
an^icillin-resistance coding region; ColEl ori, origin of 

10 replication; cl857, bacteriophage X cI857 temperature- 
sensitive repressor gene; X PL, bacteriophage X 
transcription promoter; Tl, Tl transcription terminator. 
The direction of transcription is indicated by the arrows . 
B^rill and BainHl are the restriction sites used to insert 

15 the coding region for the C-lBlj^^^ of the HSP72 of S, 

pneumoniae, FIG. 19 illustrates the 

distribution of anti-S. pnetmoniae titers in sera from 
Balb/c mice immunized with HSP72rec- Sera were collected 
after the first, second and third injection with 1 pg (O) 

20 or 5 ug (•) of HSP12^q^ and evaluated individually for 

anti-S. pneumoniae antibody by ELISA. Titers were defined 
as the highest dilution at which the A410 values were 0.1 
above the background values . Plain lines indicate the 
median reciprocal of antibody titers for each group of 

25 mice while the dashed line indicates the median value for 
preimmune sera. 

FIG. 20 illustrates the distribution of anti-5. 
pneumoniae titers in sera from Balb/c mice immunized with 
C-169rec- Sera were collected after the first, second and 

30 third injection with 1 pg (O) or 5 pg (•) of C-169rec 

evaluated individually for anti-S. pneumoniae antibody by 
ELISA. Titers were defined as the highest dilution at 
which the A410 values v/ere 0.1 above the background 
values . Plain lines indicate the median reciprocal of 

35 antibody titers for each group of mice while the dashed 
line indicates the median value for preimmune sera. 
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FIG. 21 illustrates the distribution of anti-5. 
pneumoniae titers in sera from Balb/c mice immunized with 
C-lSlj^ec- Sera were collected after the first, second and 
third injection with 0.5 \ig of C-lSl^-ec ^^id evaluated 
5 individually for anti-S. pneumoniae antibody by ELISA. 
Titers were defined as the highest dilution at which the 
A410 values were 0.1 above the background values. Plain 
lines indicate the median reciprocal of antibody titers 
for each group of mice while the dashed line indicates the 

10 median value for preimmune sera. 

FIG. 22 illustrates the antibody response of 
cynomolgus monkeys immunized with recombinant HSP72 
antigens. Groups of two monkeys were immunized with 
either HSP72j-ec or C-169r-ec protein at day 1, day 22 and 

15 day 77. Sera were collected regularly during the course of 
the immunization and evaluated individually for 
pneumococcal HSP72 specific antibody by Western blot 
analysis. Titers were defined as the highest dilution at 
which the HSP72 band was visualized. 

20 FIG. 23 illustrates the binding of hyperimmune sera 

to peptides in a solid-phase ELISA. Rabbit, mouse and 
monkey sera from animals immunized with either HSP72i.ec 
C-1692:ec protein were tested for their reactivity to 
peptides. Optical density values were obtained with sera 

25 tested at a dilution of 1:100 except for the values 

corresponding to the reactivity of rabbit sera to peptide 
MAP2 and murine sera to peptides MAP2 and MAP4 which were 
obtained with sera diluted 1:1000. 

FIG. 24 depicts the consensus sequence established i 

30 from the DNA sequences of the hsp70/dnak open reading 
frames of Streptococcus pneumoniae (spn-orf ) , 
Streptococcus pyogrenes (sga-orf ) and Streptococcus 
agalactiae (sgb-orf) and indicates the substitutions and 
insertions of nucleotides specific to each species. 

35 FIG. 25 depicts the consensus sequence established 

from the protein sequences of the Hsp70 of Streptococcus 
pneumoniae (spn-prot), Streptococcus pyogrenes (sga-prot) 
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and Streptococcus agalactiae (sgb-prot) and indicates the - 
siibstitutions and insertions of amino acids specific to 
each species. 

FIG. 26 depicts a fluorogram, which shows the effect 
5 of heat shock on S. agalactxae protein synthesis and the 
S. agalactiae protein antigen immunoprecipitated by MAb 
F2-Pn3.4. Cell lysates from t^^sj^ethionine-labeled 
proteins from S. agalactiae grown at 37®C and inciobated at 
37°C (odd numbered lanes) or heat-shocked at 43°C (even 
10 numbered lanes) were analysed by SDS-PAGE and 

f luorography . Lanes 3 and 4 show the imm\inoprecipitates 
obtained using MAb F2-Pn3.4. 

DETAILED DESCRIPTION OF THE INVENTION 

15 

According to one aspect of the invention, we 
provide novel heat shock proteins of S. pneumoniae, S, 
pyogenes and S. agalactiae, and analogues, homologues, 
derivatives aind fragments thereof, containing at least one 

20 immunogenic epitope. As used herein, a "heat shock 

protein" is a naturally occurring protein that exhibits 
preferential transcription during heat stress conditions. 
The heat shock protein according to the invention may be 
of natural origin, or may be obtained through the 

25 application of recombinant DNA techniques, or conventional 
chemical synthesis techniques . 

As used herein, "immunogenic" means having the 
ability to elicit an immune response. The novel heat 
shock proteins of this invention are characterized by 

30 their ability to elicit a protective immune response 
against Streptococcal infections, more particularly 
against lethal S. pneumoniae, S, pyogenes and ^S. 
agalactiae . 

The invention particularly provides a 

35 Streptoccus pneumoniae heat shock protein of approximately 
72 kDa ("HSP72"), having the deduced amino acid sequence 
of SEQ ID NO: 5, and analogues, homologues, derivatives and 

15 
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fragments thereof, containing at least one immunogenic 
epitope . 

As used herein, "analogues" of HSP72 are those 
S. pneumoniae proteins wherein one or more amino acid 
5 residues in the HSP72 amino acid sequence (SEQ ID NO: 5) is 
replaced by another amino acid residue, providing that the 
overall functionality and immunogenic properties of the 
analogue protein are preserved. Such analogues may be 
naturally occurring, or may be produced synthetically or 

10 by recombinant DNA technology, for exan^le, by mutagenesis 
of the HSP72 sequence. Analogues of HSP72 will possess at 
least one antigen capable of eliciting antibodies that 
react with HSP72, e.g. streptococcus pyog^enes and 
Streptococcus agalacticie , 

15 As used herein, "homologues" of HSP72 are 

proteins from Streptococcal species other than pneumoniae, 
pyogenes or agalactiae, or genera other than Streptococcus 
wherein one or more amino acid residues in the HSP72 amino 
acid sequence (SEQ ID NO: 5) is replaced by another amino 

20 acid residue, providing that the overall functionality and 
immunogenic properties of the homologue protein are 
preserved. Such homologues may be naturally occurring, or 
may be produced synthetically or by recombinant DNA 
technology. Homologues of HSP72 will possess at least one 

25 antigen capable of eliciting antibodies that react with 
HSP72, e.g. Enterococcus faecalis. 

As used herein, a "derivative" is a polypeptide 
in which one or more physical, chemical, or biological 
properties has been altered. Such alterations include, 

30 but are not limited to: amino acid substitutions, 

modifications, additions or deletions; alterations in the 
pattern of lipidation, glycosylation or phosphorylation; 
reactions of free amino, carboxyl, or hydroxyl side groups 
of the amino acid residues present in the polypeptide with 

35 other organic and non-organic molecules; and other 
alterations, any of which may result in changes in 
primary, secondary or tertiary structure. 
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The "fragments" of this invention will have at 
least one iinmunogenic epitope. An "immunogenic epitope" 
is an epitope that is instrxamental in eliciting an immune 
response. The preferred fragments of this invention will 
5 elicit an immune response sufficient to prevent or lessen 
the severity of infection, e.g., S. pneumoniae infection- 
Preferred fragrments of HSP72 include the C- terminal region 
of the polypeptides. More preferred fragment include the 
C- terminal 169-residue fragment ("C-169") (SEQ ID NO: 5, 

10 residues 439-607), the C-terminal 151-residue CC-lSl") 
(SEQ ID No: 5, residues 457-607) and smaller fragments 
consisting of peptide epitopes within the C-169 region. 
Particularly preferred fragments within the C-169 region 
of HSP72 include the peptide sequences GFDAERDAAQAALDD 

15 (residues 527-541 of SEQ ID NO : 5 ) and AEGAQATGNAGDDW 

(residues 586-600 of SEQ ID N0:5), which are exclusive to 
HSP72 of Streptococcus pneumoniae, or corresponding 
degenerate fragments from S. pyogenes or S. agalactiae 
(see FIG. 25), Even more preferred are fragments that 

20 elicit a specific immune reaction against Streptococcal 
strains. Such fragments may be selected from the 
following peptides: CS870, CS873, CS874, CS875, CS876, 
CS877, CS878, CS87 9, CS880, CS882, MAPI, MAP2, MAP3 and 
MAP4 (see TABLE 5, supra), or homologues thereof. 

25 In a further aspect of the invention, we provide 

polypeptides that are immunologically related to HSP70/72. 
As used herein, "immunologically related" polypeptides are 
characterized by one or more of the following properties: 

(a) they are immunologically reactive with 

30 antibodies generated by infection of a mammalian host with 
Streptococcus pneumoniae cells, which antibodies are 
immunologically reactive with HSP72 (SEQ ID NO: 5) and 
HSP70 (SEQ ID N0:20 and SEQ ID NO:22); 

(b) they are capable of eliciting antibodies that 

35 are immunologically reactive with HSP72 (SEQ ID NO: 5) and 
HSP70 (SEQ ID N0:20 and SEQ ID NO:22); 
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(c) they are immunologically reactive with 
antibodies elicited by immunization of a mammal with HSP72 
( SEQ ID NO : 5 ) . 

By definition, analogues, homologues and 
5 derivatives of HSP70/72 are immunologically related 
polypeptides. Moreover, all immunologically related 
polypeptides contain at least one HSP7 0/72 antigen. 
Accordingly, «HSP70/72 antigens" may be found in HSP70/72 
itself, or in immunologically related polypeptides. 

In a further aspect of the invention, we provide 
polypeptides that are immunologically related to HSP72 . 
As used herein, "immunologically related" polypeptides are 
characterized by one or more of the following properties: 

(a) they are immunologically reactive with 
antibodies generated by infection of a mammalian host with 
Streptococcus pneumoniae cells, which antibodies are 
immunologically reactive with HSP72 (SEQ ID NO:5); 

(b) they are capable of eliciting antibodies that 
are immunologically reactive with HSP72 (SEQ ID NO: 5); 

(c) they are immunologically reactive with 
antibodies elicited by immunization of a mammal with HSP72 
( SEQ ID NO : 5 ) . 

By definition, analogues, homologues and 
derivatives of HSP72 are immunologically related 
polypeptides. Moreover, all immunologically related 
polypeptides contain at least one HSP72 antigen. 
Accordingly, ^^HSP72 antigens" may be found in HSP72 
itself, or in immunologically related polypeptides. 

As used herein, "related bacteria" are bacteria 
that possess antigens capable of eliciting antibodies that 
react with HSP72. Examples of related bacteria include 
Streptococcus pneumoniae. Streptococcus pyogenes, 
Streptococcus mutans. Streptococcus sanguis. Streptococcus 
agalactiae and Enterococcus faecalis . 
35 It will be understood that by following the 

examples of this invention, one of skill in the art may 
determine without undue experimentation whether a 
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particular analogue, homologue, derivative, 
immunologically related polypeptide, or fragment would be 
useful in the diagnosis, prevention or treatment of 
disease. Useful polypeptides and fragments will elicit 
5 antibodies that are immunoreactive with HSP72 (Example 4) . 
Preferably, useful polypeptides and fragments will 
demonstrate the ability to elicit a protective immxine 
response against lethal bacterial infection (Exanple 5) . 
Also included are polymeric forms of the 

10 polypeptides of this invention. These polymeric fdrins 
include, for example, one or more polypeptides that have 
been crosslinked with crosslinkers such as avidin/biotin, 
glutaraldehyde or dimethylsuberimidate . Such polymeric 
forms also include polypeptides containing two or more 

15 tandem or inverted contiguous protein sequences, produced 
from multicistronic mRNAs generated by recombinant DNA 
technology. 

This invention provides sxibstantially pure HSP72 
and immunologically related polypeptides- The term 

20 "substantially pure" means that the polypeptides according 
to the invention, and the DNA sequences encoding them, are 
substantially free from other proteins of bacterial 
origin. Substantially pure protein preparations may be 
obtained by a variety of conventional processes, for 

25 example the procedures described in Examples 3 and 5 . 

In another aspect, this invention provides, for 
the first time, a DNA sequence coding for a heat shock 
protein of S. pneumoniae, specifically, HSP72 (SEQ ID 
NO:4, nucleotides 682-2502) . 

30 The DNA sequences of this invention also include 

DNA sequences coding for polypeptide analogues and 
homologues of HSP72, DNA sequences coding for 
immxrno logic ally related polypeptides, DNA sequences that 
are degenerate to any of the foregoing DNA sequences, and 

35 fragments of any of the foregoing DNA sequences. It will 
be readily appreciated that a person of ordinary skill in 
the art will be able to determine the DNA sequence of any 
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of the polypeptides of this invention, once the 
polypeptide has been identified and isolated, using 
conventional DNA sequencing techniques . 

Oligonucleotide primers and other nucleic acid 
5 probes derived from the genes encoding the polypeptides of 
this invention may also be used to isolate and clone other 
related proteins from S. pneumonxcie and related bacteria 
which may contain regions of DNA bacteria that are 
homologous to the DNA sequences of this invention. In 

10 addition, the DNA sequences of this invention may be used 
in PGR reactions to detect the presence of S. pneumoniae 
or related bacteria in a biological sample. 

The polypeptides of this invention may be 
prepared from a variety of processes, for example by 

15 protein fractionation from appropriate cell extracts, 
using conventional separation techniques such as ion 
exchange and gel chromatography and electrophoresis, or by 
the use of recombinant DNA techniques. The use of 
recombinant DNA techniques is particularly suitable for 

20 preparing substantially pure polypeptides according to the 
invention . 

Thus according to a further aspect of the 
invention, we provide a process for the production of 
HSP72, immunologically related polypeptides, and fragments 

25 thereof, comprising the steps of (1) culturing a 

unicellular host organism transformed with a vector 
containing a DNA sequence coding for said polypeptide or 
fragment and one or more expression control sequences 
operatively linked to the DNA sequence, and (2) recovering 

30 a substantially pure polypeptide or fragment. 

As is well known in the art, in order to obtain 
high expression levels of a trans fee ted gene in a host, 
the gene must be operatively linked to transcriptional and 
ranslational expression control sequences that are 

35 functional in the chosen expression host. Preferably, the 
expression control sequences, and the gene of interest, 
will be contained in an expression vector that further 

20 
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comprises a bacterial selection marker and origin of 
replication. If the expression host is a eukaryotic cell, 
the expression vector should further comprise an 
expression marker useful in the eukaryotic expression 
5 host. 

The DNA sequences encoding the polypeptides of 
this invention may or may not encode a signal sequence. 
If the expression host is eukaryotic, it generally is 
preferred that a signal sequence be encoded so that the 

10 mature protein is secreted from the eukaryotic host. 

An amino terminal methionine may or may not be 
present on the expressed polypeptides of this invention. 
If the terminal methionine is not cleaved by the 
expression host, it may, if desired, be chemically removed 

15 by standard techniques . 

A wide variety of expression host/vector 
combinations may be employed in expressing the DNA 
sequences of this invention. Useful expression vectors 
for eukaryotic hosts include, for example, vectors 

20 comprising expression control sequences from SV40, bovine 
papilloma virus, adenovirus, adeno-associated virus, 
cytomegalovirus, and retroviruses. Useful expression 
vectors for bacterial hosts include bacterial plasmids, 
such as those from E. coll, including pBluescript, pGEX2T, 

25 pUC vectors, col El, pCRl, pBR322, pMB9 and their 

derivatives, wider host range plasmids, such as RP4 , phage 
DNAs, e.g., the numerous derivatives of phage lambda, e.g. 
XgtlO and ?igtll, ]SrM989, and other DNA phages, such as MIS 
and filamentous single stranded DNA phages. Useful 

30 expression vectors for yeast cells include the 2\x plasmid 
and derivatives thereof. Useful vectors for insect cells 
include pVL 941. 

In addition, any of a wide variety of expression 
control sequences may be used in these vectors to express 

35 the DNA sequences of this invention. Useful expression 

control sequences include the expression control sequences 
associated with structural genes of the foregoing 

21 
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expression vectors. Examples of useful expression control 
sequences include, for example, the early and late 
promoters of SV40 or adenovirus, the lac system, the trp 
system, the TAG or TRC system, the T3 and T7 promoters the 
5 major operator and promoter regions of phage lambda, the 
control regions of fd coat protein, the promoter for 3- 
phosphoglycerate kinase or other glycolytic enzymes, the 
promoters of acid phosphatase, e.g., Pho5, the promoters 
of the yeast alpha-mating system and other constitutive 

10 and inducible promoter sequences known to control the 

expression of genes of prokaryotic or eukaryotic cells or 
their viruses, and various combinations thereof. The T7 
RNA polymerase promoter 4>10 is particularly useful in the 
expression of HSP72 in E, coli (Example 3). 

15 Host cells transformed with the foregoing 

vectors form a further aspect of this invention. A wide 
variety of unicellular host cells are useful in expressing 
the DNA sequences of this invention. These hosts may 
include well known eukaryotic and prokaryotic hosts, such 

20 as strains of E, coli, Pseudomonas, Bacillus, 

Streptomyces, fungi, yeast, insect cells such as 
Spodoptera frugiperda {SF9), animal cells such as CHO and 
mouse cells, African green monkey cells such as COS 1, COS 
7, BSC 1, BSC 40, and BMT 10, human cells, and plant cells 

25 in tissue culture. Preferred host organisms include 

bacteria such as E. coli and B. subtilis, and mammalian 
cells in tissue culture. 

It should of course be understood that not all 
vectors and expression control se<jaencBs will function 

30 equally well to express the DNA sequences of this 

invention. Neither will all hosts function equally well 
with the same expression system. However, one of skill in 
the art may make a selection among these vectors, 
expression control sequences and hosts without undue 

35 experimentation and without departing from the scope of 
this invention. For example, in selecting a vector, the 
host must be considered because the vector must replicate 
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in it. The vector's copy nmnber, the ability to control 
that copy number, and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, should 
also be considered. In selecting an expression control 
5 sequence, a variety of factors should also be considered. 
These include, for example, the relative strength of the 
sequence, its controllability, and its compatibility with 
the DNA sequences of this invention, particularly as 
regards potential secondary structures. Unicellular hosts 

10 should be selected by consideration of their compatibility 
with the chosen vector, the toxicity of the product coded 
for by the DNA sequences of this invention, their 
secretion characteristics, their ability to fold the 
protein correctly, their fermentation or culture 

15 requirements, and the ease of purification from them of 
the products coded for by the DNA seq[uences of this 
invention. Within these parameters, one of skill in the 
art may select various vector /express ion control 
sequence /host combinations that will express the DNA 

20 sequences of this invention on fermentation or in large 
scale animal culture. 

The polypeptides encoded by the DNA sequences of 
this invention may be isolated from the fermentation or 
cell culture and purified using any of a variety of 

25 conventional methods including: liquid chromatography such 
as normal or reversed phase, using HPLC, FPLC and the 
like; affinity chromatography (such as with inorganic 
ligands or monoclonal antibodies) ; size exclusion 
chromatography; immobilized metal chelate chromatography; 

30 gel electrophoresis; and the like. One of skill in the 
art may select the most appropriate isolation and 
purification techniques without departing from the scope 
of this invention. 

In addition, the polypeptides of this invention 

35 may be generated by any of several chemical techniques. 
For example, they may be prepared using the solid-phase 
synthetic technique originally described by R. B. 
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Merrifield, "Solid Phase Peptide Synthesis. I. The 
Synthesis Of A Tetrapeptide " , J. Am. Chem, Soc . , 83, 
pp, 2149-54 (1963), or they may be prepared by synthesis 
in solution. A sxunmary of peptide synthesis techniques 
5 may be found in E. Gross & H. J. Meinhofer, 4 The 

Peptides: Analysis, Synthesis, Biology; Modern Techniques 
Of Peptide And Amino Acid Analysis, John Wiley & Sons, 
(1981) and M. Bodanszky, Principles Of Peptide Synthesis, 
Springer-Verlag (1984) • 

10 The preferred corrqpositions and methods of this 

invention comprise polypeptides having enhanced 
iromunogenicity. Such polypeptides may result when the 
native forms of the polypeptides or fragments thereof are 
modified or subjected to treatments to enhance their 

15 immunogenic character in the intended recipient. 

Preferred polypeptides are fragments that are specific to 
Streptococcal species such as fragments selected from the 
C-terminal portion of thenative polypeptides . Niomerous 
techniques are available and well known to those of skill 

20 in the art which may be used, without undue 

experimentation, to substantially increase the 
immunogenicity of the polypeptides herein disclosed. For 
example, the polypeptides may be modified by coupling to 
dinitrophenol groups or arsanilic acid, or by denaturation 

25 with heat and/or SDS . Particularly if the polypeptides 
are small polypeptides synthesized chemically, it may be 
desirable to couple them to an immunogenic carrier. The 
coupling of course, must not interfere with the ability of 
either the polypeptide or the carrier to function 

30 appropriately. For a review of some general 

considerations in coupling strategies, see Antibodies , A 
Laboratory Manual , Cold Spring Harbor Laboratory, ed, E, 
Harlow and D. Lane (1988). Useful immunogenic carriers 
are well known in the art. Examples of such carriers are 

35 keyhole limpet hemocyanin (KLH) ; albumins such as bovine 
serum albumin (BSA) and ovalbximin, PPD (purified protein 
derivative of tuberculin); red blood cells; tetanus 
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toxoid; cholera toxoid; agarose beads; activated carbon; 
or bentonite. 



polypeptides disclosed herein in order to alter the 
5 lipidation state is also a method which may be used to 
increase their immunogenicity and biochemical properties . 
For example, the polypeptides or fragments thereof may be 
expressed with or without the signal sequences that direct 
addition of lipid moieties. 

10 In accordance with this invention, derivatives 

of the polypeptides may be prepared by a variety of 
methods, including by in vitro manipulation of the DNA 
encoding the native polypeptides and subsequent expression 
of the modified DMA, by chemical synthesis of derivatized 

15 DNA sequences, or by chemical or biological manipulation 
of expressed amino acid sequences. 



substitution of one or more amino acids with a different 
natural amino acid, an amino acid derivative or non-native 
20 amino acid, conservative substitution being preferred, 

e.g., 3-methylhistidine may be substituted for histidine, 
4-hydroxyproline may be substituted for proline, 5- 
hydroxy lysine may be substituted for lysine, and the like. 



25 conservative may also result in desired derivatives, e.g., 
by causing changes in charge, conformation and other 
biological properties. Such substitutions would include 
for example, substitution of a hydrophilic residue for a 
hydrophobic residue, substitution of a cysteine or proline 

30 for another residue, s\ibstitution of a residue having a 
small side chain for a residue having a bulky side chain 
or substitution of a residue having a net positive charge 
for a residue having a net negative charge. When the 
result of a given substitution cannot be predicted with 

35 certainty, the derivatives may be readily assayed 

according to the methods disclosed herein to determine the 
presence or absence of the desired characteristics. 



Modification of the amino acid sequence of the 



For example, derivatives may be produced by 



Causing amino acid substitutions which are less 
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The polypeptides may also be prepared with the 
objective of increasing stability or rendering the 
molecules more amenable to purification and preparation. 
One such technique is to express the polypeptides as 
5 fusion proteins comprising other S. pneumoniae or non- 
5. pneimoniae seciuences. It is preferred that the fusion 
proteins comprising the polypeptides of this invention be 
produced at the DNA level, e.g., by constructing a nucleic 
acid molecule encoding the fusion, transforming host cells 

10 with the molecule, inducing the cells to express the 

fusion protein, and recovering the fusion protein from the 
cell culture. Alternatively, the fusion proteins may be 
produced after gene expression according to known methods. 
An example of a fusion protein according to this invention 

15 is the FUCI/HSP72 (C--169) protein of Example 3, infra . 

The polypeptides of this invention may also be 
part of larger multimeric molecules which may be produced 
recombinantly or may be synthesized chemically. Such 
mul timers may also include the polypeptides fused or 

20 coupled to moieties other than amino acids, including 
lipids and carbohydrates . 

The polypeptides of this invention are 
particularly well-suited for the generation of antibodies 
and for the development of a protective response against 

25 disease. Accordingly, in another aspect of this 

invention, we provide antibodies, or fragments thereof, 
that are immunologically reactive with HSP72 . The 
antibodies of this invention are either elicited by 
immunization with HSP72 or an immunologically related I 

30 polypeptide, or are identified by their reactivity with 
HSP72 or an immunologically related polypeptide. It 
should be understood that the antibodies of this invention 
are not intended to include those antibodies which are 
normally elicited in an animal upon infection with 

35 naturally occurring S. pneumoniae and which have not been 
removed from or altered within the animal in which they 
were elicited. 
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The antibodies of this invention may be intact 
immunoglobulin molecules or fragments thereof that contain 
an intact antigen binding site, including those fragments 
known in the art as F(v), Fab, Fab* and F(ab')2. The 
5 antibodies may also be genetically engineered or 

synthetically produced. The antibody or fragment may be 
of animal origin, specifically of mammalian origin, and 
more specifically of murine, rat, monkey or humaoi origin. 
It may be a natural antibody or fragment, or if desired, a 

10 recombinant antibody or fragment. The antibody or 

antibody fragments may be of polyclonal, or preferably, of 
monoclonal origin. They may be specific for a number of 
epitopes but are preferably specific for one. 
Specifically preferred are the monoclonal antibodies Fl- 

15 Pn3.1, F2~Pn3.2, F2-Pn3.3 and F2-Pn3.4 of Example 2, 

infra . One of skill in the art may use the polypeptides 
of this invention to produce other monoclonal antibodies 
which could be screened for their ability to confer 
protection against S. pneumoniae , S. pyogrenes, S. 

20 agalkctiae or other Streptococcal related bacterial 

infection when used to immunize naive animals. Once a 
given monoclonal antibody is found to confer protection, 
the particular epitope that is recognized by that antibody 
may then be identified. Methods to produce polyclonal and 

25 monoclonal antibodies are well Icnown to those of skill in 
the art. For a review of such methods, see Antibodies , A 
Laboratory Manual, supra, and D.E. Yelton, et al . , Ann . 
Rev, of Biochem. , 50, pp. 657-80 (1981) . Determination of 
immunoreactivity with a polypeptide of this invention may 

30 be made by any of several methods well known in the art, 
including by immunoblot assay and ELISA. 

An antibody of this invention may also be a 
hybrid molecule formed from immunoglobulin sequences from 
different species (e.g., mouse and hiiman) or from portions 

35 of immunoglobulin light and heavy chain sequences from the 
same species. It may be a molecule that has multiple 
binding specificities, such as a bifxmctional antibody 
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prepared by any one of a number of techniques knovm to 
those of skill in the art including: the production of 
hybrid hybridomas ; disulfide exchange; chemical cross- 
linking; addition of peptide linkers between two 
5 monoclonal antibodies; the introduction of two sets of 
immunoglobulin heavy and light chains into a particular 
cell line; and so forth. The antibodies of this 

invention may also be h\aman monoclonal antibodies, for 
exairple those produced by immortalized human cells, by 

10 SCID-hu mice or other non-human animals capable of 

producing "human" antibodies, or by the expression of 
cloned human immunoglobulin genes. 

In sum, one of skill in the art, provided with 
the teachings of this invention, has available a variety 

15 of methods which may be used to alter the biological 

properties of the antibodies of this invention including 
methods which would increase or decrease the stability or 
5^alf-life, immunogenicity, toxicity, affinity or yield of 
a given antibody molecule, or to alter it in any other way 

20 that may render it more suitable for a particular 
application. 

The polypeptides, DNA sequences and antibodies 
of this invention are useful in prophylactic, therapeutic 
and diagnostic compositions for preventing, treating and 

25 diagnosing disease. 

Standard immunological techniques may be 
employed with the polypeptides and antibodies of this 
invention in order to use them as immunogens and as 
vaccines. In particular, any suitable host may be 

30 injected with a pharmaceutically effective amount of 
polypeptide to generate monoclonal or polyvalent 
antibodies or to induce the development of a protective 
immunological response against disease. Preferably, the 
polypeptide is selected from the group consisting of 

35 HSP72 (SEQ ID NO : 5 ) , HSP70 (SEQ ID N0:20 and SEQ ID N0:22) 
or fragments thereof. 
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As used herein, a "pharmaceutically effective 
amoimt" of a polypeptide or of an antibody is the amount 
that, when administered to a patient, elicits an immune 
response that is effective to prevent or lessen the 
5 severity of Streptococcal or related bacterial 
infections . 

The administration of the polypeptides or 
antibodies of this invention may be accon^lished by any of 
the methods described in Exaxnple 10, infra , or by a 

10 variety of other standard procedures. For a detailed 
discussion of such techniques, see Antibodies, A 
Laboratory Manual , Cold Spring Harbor Laboratory, ed. 
E. Harlow and D. Lane (1988). Preferably, if a 
polypeptide is used, it will be administered with a 

15 pharmaceutically acceptable adjuvant, such as complete or 
incomplete Freund's adjuvant, RIBI (muramyl dipeptides) or 
ISCOM ( immunostimulating complexes). Preferably, the 
composition will include a water-in-oil emulsion or 
aluminum hydroxide as adjuvant and will be administered 

20 intramuscularly. The vaccine coitposition may be 

administered to the patient at one time or over a series 
of treatments. The most effective mode of administration 
and dosage regimen will depend upon the level of 
immunogenic ity, the particular composition and/ or adjuvant 

25 used for treatment, the severity amd course of the 

expected infection, previous therapy, the patient's health 
status and response to immunization, and the judgment of 
the treating physician. For example, in an 
immunocompetent patient, the more highly immunogenic the 

30 polypeptide, the lower the dosage and necessary number of 
immunizations. Similarly, the dosage and necessary 
treatment time will be lowered if the polypeptide is 
administered with an adjuvant. 

Generally, the dosage will consist of an initial 

35 injection, most probably with adjuvant, of cJDOut 0.01 to 
10 mg, and preferable 0.1 to 1.0 mg, HSP72 antigen per 
patient, followed most probably by one or maybe more 
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booster injections. Preferably, boosters will be 
administered at about 1 and 6 months after the initial 
injection. 

Any of the polypeptides of this invention may be 
! used in the foann of a pharmaceutic ally acceptable salt. 
Suitable acids and bases which are capable of forming 
salts with the polypeptides of the present invention are 
well known to those of skill in the art, and include 
inorganic and organic acids and bases . 

To screen the polypeptides and antibodies of 
this invention for their ability to confer protection 
against diseases caused by S. pneumoniae or related 
bacteria, or their ability to lessen the severity of such 
infection, one of skill in the art will recognize that a 
number of animal models may be used. Any animal that is 
susceptible to infection with S. pneumoniae or related 
bacteria may be useful. The Balb/c mice of Example 5, 
infra, are the preferred animal model for active 
immunoprotection screening, and the severe-combined 
immunbdeficient mice of Example 5 are the preferred animal 
model for passive screening. Thus, by administering a 
particular polypeptide or antibody to these animal models, 
one of skill in the art may determine without undue 
experimentation whether that polypeptide or antibody would 
be useful in the methods and compositions claimed herein. 

According to another embodiment of this 
invention, we describe a method which comprises the steps 
of treating a patient with a vaccine comprising a 
pharmaceutically effective amount of any of the 
polypeptides of this invention in a manner sufficient to 
prevent or lessen the severity, for some period of time, 
of Streptococcal or related bacterial infection. Again, 
the preferred polypeptide for use in such methods is 
HSP70/HSP72, or fragments thereof. 

The polypeptides, DNA sequences and antibodies 
of this invention may also form the basis for diagnostic 
methods and kits for the detection of pathogenic 
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organisms. Several diagnostic methods are possible. For 
example, this invention provides a method for the 
detection of Streptococcus pneumoniae, Streptococcus 
pyogenes, Streptococcus agalactiae or related bacteria in 
5 a biological sait^le comprising the steps of: 

(a) isolating the biological sample from a 

patient ; 

(b) incubating an antibody of this invention, 
or fragment thereof with the biological san^le to form a 

10 mixture ; and 

(c) detecting specifically bound antibody or 
fragment in the mixture which indicates the presence of 
Streptococcus pneumoniae. Streptococcus pyogenes. 
Streptococcus agalactiae or related bacteria. Preferable 

15 antibodies for use in this method include monoclonal 
antibodies Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4. 

Alternatively, this invention provides a method 
for the. detection of antibodies specific to Streptococcus 
pneumoniae or related bacteria in a biological sair^le 

20 conprising: 

(a) isolating the biological sample from a 

patient; 

(b) incubating a polypeptide of this invention 
or fragment thereof, with the biological sample to form a 

25 mixture ; and 

(c) detecting specifically boimd polypeptide in 
the mixture which indicates the presence of antibodies 
specific to Streptococcus pneumoniae or related bacteria. 
HSP72 (SEQ ID NO : 5 ) , the C--169 fragment thereof (residues 

30 439-607 of SEQ ID NO:5), the C-151 fragment thereof 

(residues 457-607 of SEQ ID NO; 5) and peptide fragments 
GFDAERDAAQAALDD (residues 527-541 of SEQ ID NO: 5) and 
AEGAQATGNAGDDW (residues 586-600 of SEQ ID NO: 5) are the 
preferred polypeptide and fragments in the above method 
35 for the detection of antibodies. 

One of skill in the art will recognize that 
these diagnostic tests may take several forms, including 
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an enzyme- linked immunosorbent assay (ELISA), a 
radioimmunoassay or a latex agglutination assay. 

The diagnostic agents may be included in a kit 
which may also comprise instructions for use and other 
5 appropriate reagents, preferably a means for detecting 
when the polypeptide or antibody is bound. For example, 
the polypeptide or antibody may be labeled with a 
detection means that allows for the detection of the 
polypeptide when it is bound to an antibody, or for the 

10 detection of the antibody when it is bound to 

S. pneumoniae or related bacteria. The detection means 
may be a fluorescent labeling agent such as fluorescein 
isocyanate (FIC) , fluorescein isothiocyanate (FITC) , and 
the like, an enzyme, such as horseradish peroxidase (HRP) , 

15 glucose oxidase or the like, a radioactive element such as 
I or ^^Cr that produces gamma ray emissions, or a 
radioactive element that emits positrons which produce 
gamma rays upon encounters with electrons present in the 
test solution, such as ^^C, "o, or ^^N. Binding may also 

20 be detected by other methods, for exan^le via avidin- 

biotin complexes. The linking of the detection means is 
well known in the art. For instance, monoclonal antibody 
molecules produced by a hybridoma may be metabolically 
labeled by incorporation of radioisotope-containing amino 

25 acids in the culture medium, or polypeptides may be 
conjugated or coupled to a detection means through 
activated functional groups. 

The DNA sequences of this invention may be used 
to design DNA probes for use in detecting the presence of 

30 Streptococcus pneumoniae or related bacteria in a 

biological san^le. The probe-based detection method of 
this invention comprises the steps of: 

(a) isolating the biological sanple from a 

patient; 

(b) incubating a DNA probe having a DNA 
sequence of this invention with the biological san^le to 
form a mixture; and 
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(c) detecting specifically bound DNA probe in 
the mixture which indicates the presence of Streptococcus 
pneumoniae or related bacteria. 

The DNA probes of this invention may also be 
5 used for detecting circulating nucleic acids in a sample, 
for example using a polymerase chain reaction, as a method 
of diagnosing Streptococcus pneumoniae or related 
bacterial infections. The probes may be synthesized using 
conventional techniques and may be immobilized on a solid 
phase, or may be labeled with a detectable label. A 
preferred DNA probe for this application is an oligomer 
having a sequence complementairy to at least about 6 
contiguous nucleotides of HSP72 (SEQ ID N0:4, nucleotides 
682-2502) . 

The polypeptides of this invention may also be 
used to purify antibodies directed against epitopes 
present on the protein, for exan^le, using immunoaf f inity 
purification of antibodies on an antigen column. 

The antibodies or antibody fragments of this 
invention may be used to prepare stibstantially pure 
proteins according to the invention for exan^le, using 
immunoaf f inity purification of antibodies on an antigen 
column . 

25 EXAMPLES 

In order that this invention may be better 
understood, the following examples are set forth. These 
examples are for purposes of illustration only, and are 
30 not to be construed as limiting the scope of the invention 
in any manner . 

Example 1 describes the identification of HSP72, 
an immunoreactive heat shock protein according to the 
invention. Example 2 describes the isolation of 
monoclonal antibodies against epitopes of HSP72 . Example 
3 describes the preparation of recombinant HSP72 and 
fragments of HSP72 according to the invention. Example 4 
describes the antigenic specificity and immunoreactivity 
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of monoclonal antibodies directed against HSP72 , and the 
identification of immunologically related proteins 
according to the invention. Example 5 describes processes 
for obtaining substantially pure HSP72 , and the use of 
5 HSP72 or antibodies against it to protect against 

experimental S. pneumoniae infection. Example 6 describes 
the preparation of recombinant C-151 fragment of HSP72 
according to the invention. Exaitple 7 describes the 
hiomoral immune response following the immunization with 
recombinant HSP72 or fragments of HSP72 according to the 
invention. Exair^le 8 describes the localization of linear 
B-cell epitopes on the HSP72 . Example 9 describes the 
hsp70 genes and HSP7 0 proteins from S. agalactiae and S. 
pyogenes. Example 10 describes the use of HSP72 antigen 
15 in a human vaccine. 

EXAMPLE 1 - Identification of Immunoreactive 
S. pneumoniae Heat Shock Proteins 

20 A. Procedures 

Unless otherwise noted, the following procedures 
were used throughout the Examples herein. 

25 1 . Bacteria 

S. pneumoniae strains were provided by the 
Laboratoire de la Sante Publique du Quebec, Sainte-Anne de 
Bellevue. S. pneumoniae strains included type 4 strain 53 
and type 6 strain 64. if not specified, S. pneumoniae 
type 6 strain 64 was used. Bacterial strains were grown 
overnight at 37°C in 5% CO2 on chocolate agar plates. 
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2 - Antigen Preparations 

Various S. pneumoniae antigens were prepared for 
immvmization and immunoassays. Heat-killed whole cell 
antigens were obtained by incubating bacterial suspensions 
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in a water bath prewarmed at 56 C for 20 minutes. 
Detergent-soluble proteins were extracted from 
S. pneumoniae as follows. Heat-killed bacteria were 
suspended in 10 mM Hepes buffer { 4- { 2 -Hydroxyethyl ) -1- 
5 piperazinethan-sulf onsaure) {Boehringer Mannheim GmbH, 

Germany) at pH 7.4 and sonicated at 20,000 Kz/second, four 
times for 30 seconds. Intact cells and large debris were 
removed by centrif ugation at 1,7 00 g for 20 minutes. The 
supernatant was collected and centrif uged at 100,000 g for 

10 60 minutes. The pellet was resuspended in 1 ml of Hepes 
buffer, and 1 ml of 2% N-lauroyl sarcosine (Sigma Chemical 
Co., St. Louis, Mo.) was added. The mixture was incubated 
for 3 0 minutes at room temperature and the detergent- 
soluble fraction was harvested by centrif ugation at 

15 100,000 g for 60 minutes. 

3 . Heat Shock Treatment 

S, pneumoniae bacteria (type 4, strain 53 and 
20 type 6, strain 64) were resuspended in Eagle's Minimal 

Essential Medium lacking methionine ( ICN Biomedicals Inc., 
Costa Mesa, CA) and supplemented with 1% BIO-X® (Quelab 
Laboratories, Montreal, Canada) for 15 minutes at 37'*C and 
then divided into fractions of equal volume. The sair^les 
25 were incubated at either 37*^C or 45°C for 5 minutes and 
then labeled with 100 uCi/ml [^^S] methionine (ICN) for 10, 
30, or 60 minutes at37°C. The bacteria were harvested and 
cell extracts were prepared using Tris-HCl lysis buffer as 
described above, or SDS-PAGE sample buffer. 

30 

4 . Immunization Of Mice 

Female Balb/c mice (Charles River Laboratories, 
St-Constant, Quebec, Canada) were immunized with 
35 S. pneumoniae antigens. Immune sera to S. pneumoniae 
type 6 strain 64 were obtained from mice immunized, at 
two-week intervals, by subcutaneous injections of 10^ heat- 
killed bacteria or 20 \ig of detergent-soluble pneumococcal 
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proteins absorbed to alxominum hydroxide adjuvant 
(Alhydrogel®; Cedarlane Laboratories Ltd., Homy, Ontario, 
Canada) . Blood samples were collected prior to 
immunization and at seven days following the first and 
5 second immunization. 

^ • SDS-PAGE and Immunoassays 

Cell extracts were prepared for SDS-PAGE, 
Western blot analysis and radioimmunoprecipitation assay 
by incubating bacterial suspensions in Tris-HCl lysis • 
buffer (50mM Tris, 150 mM NaCl, 0.1% Na dodecyl sulfate, 
0.5% Na deoxycholate, 2% Triton® X-100, 100 ug/ml 
phenylmethylsulfonylfluoride, and 2pg/ml aprotinin) at pH 
15 8.0 for 30 minutes on ice. Lysed cells were cleared by 
centrifugation and the supernatants were aliquoted and 
kept frozen at -70 C. 

SDS-PAGE were performed on a 10% polyacrylamide 
gel according to the method of Laemmli [ Nature . 227, 
20 pp. 680-685 (1970)], using the Mini Protean® system (Bio- 
Rad Laboratories Ltd. , Mississauga, Canada). Samples were 
denatured by boiling for 5 minutes in sample buffer 
containing 2% 2-mercaptoethanol . Proteins were resolved 
by staining the polyacrylamide gel with PhastGel Blue® 
25 (Pharmacia Biotech Inc., Bale d'Urfe, Canada). The 

radiolabeled products were visualized by f luorography . 
Fluorograms were scanned using a laser densitometer. 

Immunoblot procedures were performed according 
to the method of Towbin et al . [ Proc. Natl. Acad. Sci. 
30 USA, 76, pp. 4350-4354 (1979)]. The detection of antigens 
reactive with antibodies was performed by an indirect 
antibody immunoassay using peroxidase-labeled anti-mouse 
immunoglobulins and the o-dianisidine color substrate. 

Radioimmunoprecipitation assays were performed 
as described by J. A. Wiley et al . [ J. Virol. . 66, 
pp. 5744-5751 (1992)]. Briefly, sera or hybridoma culture 
supernatants were added to radiolabeled samples containing 
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equal amounts of [ ^^S] methionine . The mixtures were 
allowed to incubate for 90 minutes at 4 C with constant 
agitation. The immune complexes were then precipitated 
with bovine serum albumin- treated protein A Sepharose 
5 (Pharmacia) for 1 hour at 4 C. The beads were pelleted 
and washed three times in Tris buffered saline at pH 8.0, 
and the antigen complexes were then dissociated by boiling 
in sample buffer. The antigens were analyzed by 
electrophoresis on SDS-PAGE. The gels were fixed, 
10 enhanced for fluorography using Aitplify® (Amer sham "Canada 
Limited, Oakville, Ontario, Canada), dried, and then 
exposed to X-ray film. 

B. Characterisation of the Heat 
15 Shock Response in S, pneumoniae 

We studied the heat shock response of 
S. pneumoniae by examining the pattern of protein 
synthesis before and after a shift from 37°C to 45*'C. 

20 FIG, 1 shows the results when S. pneumoniae type 6 strain 
64 (panel A) and type 4 strain 53 (panel B) were grown at 
37^C, incubated at 3 7^C (lanes 1,3,5,7 and 9) or at 45''C 
(lanes 2, 4, 6, 8 and 10) for 5 minutes, and then labeled- 
with [^^S] methionine for 10 minutes (lanes 1,2 and 7,8), 30 

25 minutes (lanes 3,4 and 9,10), or 60 minutes (lanes 5,6). 

The fluorogram derived from SDS-PAGE indicated 
that the synthesis of at least three proteins was 
increased by increasing the temperature (FIG. 1). The 

t 

most prominent induced protein was about 72 kDa (HSP72j, 
30 whereas the other two were approximately 80 kDa (HSP80) 
and 62 kDa (HSP62) . Increased protein synthesis was 
already apparent after 10 minutes of labeling (FIG. 1, 
lanes 1, 2 and 7, 8) and became more significant when the 
labeling period was prolonged to 3 0 minutes (FIG. 1, 
35 lanes 3, 4 and 9, 10) and 60 minutes (FIG. 1, lanes 5, 6). 
The effect of elevated temperature on the protein 
synthesis profile of two different S. pneumoniae strains 
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was similar, with HSPs of similar molecular mass being 
synthesized (compare Panel A (type 6 strain 64) to Panel B 
(type 4 strain 53) in FIG. 1). 



5 scanning the protein synthesis profiles allowed the 
estimation of the relative amounts of proteins. For 
example, with respect to heat-shocked S. pneumoniae type 6 
strain 64, after 10 minutes of labeling, HSP80 and HSP62 
made up 2.9% and 6.8% of the labeled proteins, 

10 respectively, compared to less than 0.1% at 37°C (FIG. 2). 
Labeled proteins having an apparent molecular mass of 72 
kDa were detected at both 37°C and 45°C conditions 
(FIG. 2) . Radioimmunoprecipitation analysis revealed, 
however, that HSP72 was undetectable at 37°C ( supra ; and 

15 FIGS. 3, 4 and 6) thus indicating that peak 9 from FIG. 2 
corresponds to protein component (s) comigrating with 
HSP72 . Assuming no variation in the labeling of this 
material, these results would suggest that the amount of 
HSP72 represents 8.7% of the total labeled cell protein 

20 after heat shock treatment . A coirtparison of the 

densitometric tracings revealed that cellular proteins 
corresponding to peaks 4, 10, 13, 17, 19, and 21 were 
synthesized at almost the same rate irrespective of heat 
shock treatment (FIG. 2). However, the synthesis of 

25 several proteins (peaks 1, 2, 3, 15, 20, 22, 24, and 26) 
declined considerably in response to heat shock (FIG. 2) 

C. Immune Responses to S, pneumoniae HSPs 

In order to a^ssess the antibody response to 
pneumococcal HSPs, mouse sera were first assayed by 
radioimmunoprecipitation. The repertoire of labeled 
proteins recognized by sera from mice immunized with 
S. pneumoniae antigen preparations are shown in FIGS. 3 
35 and 4. FIG. 3 relates to detergent soluble protein 

preparations. FIG. 4 relates to heat-killed bacterial 
preparation. Although many bands were detected by most 
antisera, HSP72 was a major precipitation product. The 



Analysis of the densitometric tracings from 
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specificity of antibodies for HSP72 was demonstrated by 
the detection of proteins among heat-shocked products only 
(FIG. 3, lanes 4, 6, 8 and 10; FIG. 4, lanes 4, 6 and _8) . 
Interestingly, all immianized mice consistently recognized 
5 HSP72 . The antibodies reactive with the HSP72 were not 
specific to the strain used during the immunization since 
strong reactivities were observed with heterologous 
S. pneumoniae HSP72 . It should be noted that in addition 
to HSP72, one sera precipitated comigrating product 

10 labeled at both 37°C and 45°C (FIG. 4, lane 4). This 72 
kDa-product probably corresponds to component from peak 9 
in FIG. 2 and was not detected in immunoblots . HSP62 is 
another immune target which was precipitated by some but 
not all immune sera (FIG. 3, lane 6 and, FIG. 4, lanes 4 

15 and 6) . None of the sera tested reacted with HSP80. No 
proteins were precipitated when preimmune sera taken from 
the mice used in this study were tested for the presence 
of antibodies reactive with the labeled products . 

As depicted in FIGS. 3 and 5, antibodies to 
20 HSP72 could be detected after one immunization with either 
detergent-soluble proteins or whole cells extracts of 
S. pneumoniae. In addition, a marked increase in the 
antibody response to HSP72 was observed after a second 
immunization (FIG. 3, con^are 4 and 6, and lanes 8 and 
25 10) . 

The immunoblot patterns of 15 mice immunized 
with heat-killed S. pneumoniae bacteria were remarkably 
consistent with the results of the previously described 
radioimmunoprecipitation. Although antibody response 

30 variation occurred to a variety of proteins, HSP72 was a 
major immunoreactive antigen with 8 (53%) positive sera 
after the first immunization (FIG. 5) . Antibodies to 
HSP72 were detected in 13 out of 15 (87%) immune sera 
tested after the second immunization. Two other prominent 

35 antigens having apparent molecular mass of 53.5 and 47 kDa 
were detected in 5 (33%) and 7 (47%) sera, respectively 
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(FIG. 5). The 72 kDa-reactive band was confirmed as the - 
pneumococcal HSP72 by using recombinant HSP72 antigens 
(Example 3, infra ) in an immunoblot assay. Preimmune sera 
failed to detect any pneumococcal proteins. 

EXAMPLE 2 - Isolation of Monoclonal Antibodies 
Against Epitopes of HSP72 

A. Procedures 

1 . Immunization of Mice And Fusion 

Female Balb/c mice (Charles River Laboratories) 
were immunized with S. pneumoniae antigens. One set of 
mice (fusion experiment 1) were immunized by peritoneal 
injection with lO' formalin-killed whole cell antigen from 
strain MTL suspended in Freund's complete adjuvant, and 
were boosted at two-week intervals with the same antigen 
and then with a sonicate from heat-killed bacteria in 
Freund's incoitplete adjuvant. A second group of mice 
(fusion experiment 2) were immxinized three times at three- 
week intervals with 75 ug of detergent -soluble 
pneumococcal antigens extracted from strain 64 (type 6). in 
25 ug of Quil A adjuvant (Cedarlane Laboratories Ltd., 
25 Hornby, Ontario, Canada) . Three days before fusion, all 
mice were injected intraperitoneally with the respective 
antigen suspended in PBS alone. Hybridomas were produced 
by fusion of spleen cells with nonsecreting SP2/0 myeloma 
cells as previously described by J. Hamel et al . [ j. Med. 
30 Microbiol . , 23, pp. 163-170 (1987)]. Specific hybridom^ 
were cloned by sequential limiting dilutions, expanded and 
frozen in liquid nitrogen. The class, subclass, and 
light-chain type of MAbs were determined by ELISA as 
described by D. Martin et al . , [ Eur. J. Immunol. . 18, 
35 pp. 601-606 (1988)] using reagents obtained from Southern 
Biotechnology Associates Inc. (Birmingham, AL) . 
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2 . Subcellular Fractionation 

Pneumococci were separated into subcellular 
5 fractions according to the technique described by Pearce 
et al. [ Mol. Microbiol. , 9, pp. 1037-1050 (1993)]. 
Briefly, S. pneumoniae strain 64 (type 6) v/as grown in 
Todd Hewitt broth supplemented with 0.5% (w/v) yeast 
extract for 6 hours at 37°C and isolated by centrif ugation. 

10 Cell pellets v/ere resuspended in 25 mM Tris-HCl pH 8.0, 1 
mM EDTA, 1 mM phenylmethyl sulphonyl fluoride (PMSF) and 
sonicated for 4 minutes with 15 second bursts. Cellular 
debris were removed by centrif ugation . The bacterial 
membranes and cytoplasmic contents were separated by 

15 centrif ugation at 98,000 g for 4 hours. The cytoplasmic 
(supernatant) and the membrane (pellet) fractions were 
adjusted to 1 mg protein per ml and subjected to SDS-PAGE 
and immunoblot analyses . 

20 B. Identification and Characterization 

of MAbs to the HSP72 of S. pneumoniae 

Culture supernatants of hybridomas were 
in.it ially screened by dot enzyme immunoassay using whole 

25 cells from S. pneumoniae strain 65 (type 4) according to 
the procedures described in D. Martin et al . ( supra ) . 
Positive hybridomas v;ere then retested by immunoblot ting 
in order to identify the hybridomas secreting MAbs 
reactive with the HSP72 . Of 26 hybridomas with anti- 

30 S. pneumoniae reactivity in immunoblot, four were found to 
recognize epitopes present on a protein band with an 
apparent molecular mass of 72 kDa. The four hybridomas 
were designated Fl-Pn3.1 (from fusion experiment 1) and 
F2~Pn3.2, F2-Pn3,3 and F2-Pn3.4 (from fusion experiment 

35 2). Isotype analysis revealed that hybridoma Fl-Pn3.1 

(from fusion experiment 1) secreted IgG.2ak immunoglobulins, 
whereas hybridomas F2-Pn3.2, F2-Pn3.3, and F2-Pn3.4 (from 

41 



BNSOOCIO: <WO_9&«092aA1J_> 



wo 96/40928 PCT/CA96/00322 

fusion experiment 2) all secreted IgGix. The specificity^ 
of the MAbs for HSP72 was clearly demonstrated by the lack 
of radioimmunoprecipitation activity against 
[^^Slmethionine-labeled S. pneumoniae proteins obtained 
5 from cultures incubated at 3 7°C and the immunoprecipitation 
of a 72kDa-protein with heat shock-derived lysates 
incubated at 4 5°C. FIG. 6, (lanes 5 and 6) demonstrates the 
results obtained for MAb Fl-Pn3.1. The same results were 
obtained with MAbs F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4 

1^ [^^S]methionine-labelled lysates from nonheat- 

shocked and heat-shocked S. pneumoniae cells probed with 
the MAbs were electrophoresed on SDS-PAGE gels and then 
subjected to Western blot analysis. The resulting 
immunoblots revealed the presence of HSP72 antigen in both 

15 samples. FIG. 7, panel A, shows the results obtained for 
MAb Fl-Pn3,l. The same results were obtained with MAbs 
F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4. Accordingly, the heat 
shock stress did not significantly increase the reactivity 
of anti-HSP72 monoclonal antibodies. The fluorograph of 

20 the immunoblots, however, clearly showed that the heat 
shock response had occurred (FIG. 7, panel B) . These 
experiments revealed that the rate of synthesis of 
S. pneumoniae HSP72 increases in response to heat shock, 
but that the absolute amounts of HSP72 do not increase 

25 after heat shock. 

C. Cellular localization of HSP72 

In order to investigate the cellular location of 
30 HSP72, S. pneumoniae cell lysates were fractionated by 
differential centrif ugation resulting in a soluble 
fraction and a particulate fraction, enriched in membrane 
proteins, supra . Sample containing 15 ug protein of 
membrane fraction (lane 1) and cytoplasmic fraction (lane 
35 2) of S. pneumoniae were electrophoresed on SDS-PAGE, 
transferred to nitrocellulose and probed with MAb Fl- 
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Pn3.1. In the resulting Western blots, HSP72 was found in 
both fractions, with the majority of the protein 
associated with the cytoplasmic fraction (FIG. 8) . 

5 EXAMPLE 3 - Molecular Cloning, Sequencing 

and Expression of Genes Coding 
for HSP72 Antigens 



A. 



Procedures 



10 



Strains and Plasmids 



Strains and plasmids used in this study are listed 



in Table 1. 
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TABLE 1: BACTERIAL STRAXNS, PHAGES AND PIiASHZDS 



Strain, 
Phage Plasmid 



Relevant Characteristics 



Reference 
or Source 



coll Strains 
JM109 

Y1090 
BL21 (DE3 ) 

Phages 

Tlgtll 

\JBD7 

?UBD17 



Plasmids 
PWSK29 

PWKS3 0 

PJBD171 
PJBD177 

PJBD179 

pT7-5 
pT7-6 

pJBDfSl 

pJBDf62 

pDELTAl 
pJBDAl 



A(lac-proAB) [F'traD proAB 

lacl^ ZAM15] 

rk-rtik- Ion supF [pMC9] 

lacUV5-T7 RNA polymerase 

CJ857 ^100 cloning vector 

LacZ-HSP72 fusion; 2.3 kb 
EcoRI fragment in Xgtll 



FucI-HSP72 chimeric; 
kb EcoRI and 
2.3 kb EcoRI fragments in 
A^gtll 

Amp'; low copy number 
cloning vector 
same as pWSK29 but 
opposite multi cloning 

.site 

same as XJBD17 but in 
pWSK29 

2.8 kb XhoI-EcoRI 
fragment in pWKS3 0 
no recombinant HSP72 
protein expressed 
FUCI-HSP72 fusion; 2 . 4 kb 
EcoRI and 0.8 IdD EcoRI- 
EcoRV fragments in pWSK29 
Amp^; T7 promoter <I>10 

same as pT7-5 but 
opposite multi cloning 
site 

same as pJBD179 but in 
pT7-5 

same as pJBD17 9 but in 
pT7-6 

Amp""; Tn 1000 

same as pJBD17 9 but in 
pDELTAl 



BRL 

Amersham 

Studier 

et al. ( infra ) 

Amersham 
This study 



2 . 4 This study 



Wang et al . 
(infra) 
Wang et al . 
(infra) 

This study 
This study 

This study 

Tabor et al , 
( infra ) 
Tabor et al , 
(infra) 

This study 

This study 

BRL 

This study 
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PJBD291 



HSP72; 3.2 kb Hindlll 
fragment in pWSK29 



This study 



pJBDkSl 



same as pJBD291 but in 
pT7-5 



This study 



PJBDA4 



same as pJBD291 but in 
pDELTAl 



This study 



E. coli strains were grown in L broth or on L 



5 agar at 37^*0. When necessary, ampicillin was added to the 
media at the concentration of 50 ]xg/ml. Plasmids were 
isolated by using the Magic /Wizard® Mini-Preps kit 
(Promega, Fisher Scientific, Ottawa, Canada) . 



DNA molecular v/eight standards were purchased from 
Boehringer Mannheim Canada, Laval, Quebec or Pharmacia 

15 Biotech, Uppsala, Sweden. DNA restriction endonuclease 
digestion and ligation were performed as described by 
*J . Sambrook et al . [ Molecular cloning . A laboratory 
manual , Cold Spring Harbor Laboratory Press, N.Y. 
(1989)]. Agarose gel electrophoresis of DNA fragments was 

20 performed following the procedure of J. Sambrook et al . 

( supra ) using the TAB buffer (0.04 M Tris-acetate; 0.002 M 
EDTA) from Boehringer Mannheim. DNA fragments were 
purified from agarose gel by using the Prep-A-Gene® DNA 
purification kit (Bio-Rad Laboratories Ltd., Mississauga, 

25 Ontario) . Transformation was carried out by 

electroporation with the Gene Pulser® (Bio-Rad) following 
the protocol provided by the manufacturer. 



10 



2 . General Recombinant DNA Techniques 
Restriction endonuc leases, T4 DNA ligase, and 



3. 



Construction and Screening 
of Genomic Library 



30 



A genomic S. pneumoniae DNA library was 
generated in the bacteriophage expression vector Tlgtll 
(Xgtll cloning system, Amersham) according to the 
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procedure provided by the manufacturer. Chromosomal DNA 
of S. pneumoniae type 6 strain 64 was prepared by 
following the procedure of J,C. Paton et al . [Infect. 
Immun. , 54, pp. 50-55 (1986)]. The S. pneumoniae 
i chromosomal DNA was partially digested with EcoRI, and the 
4- to 7-kb fragments were fractionated and purified from 
agarose gel. The fragments were ligated into ^igtll arms, 
packaged, and the resulting phage mixtures used to infect 
E. coli Y1090. Immunoscreening of plaques expressing 
recombinant HSP72 antigens was performed using HSP72- 
specific monoclonal antibody Fl-PnS.l, supra. Plaque 
clones expressing peptides recognized by MAb Fl-Pn3.1 were 
isolated and purified. Liquid lysates were prepared and 
DNA was purified from a Promega LambdaSorb phage adsorbent 
according to the manufacturer's directions followed by 
conventional DNA purification procedures. 

4 . Southern Blot Analysis 



20 



The nonradioactive DIG DNA Labelling and 
Detection kit, obtained from Boehringer Mannheim, was used 
to perform Southern blot analysis in this example. The 
DNA fragments selected for use as probes ( infra ) were 
purified by agarose gel electrophoresis and then labelled 
25 with digoxigenin ( DIG ) -11-dUTP . Pneumococcal chromosomal 
DNA was digested with Hindlll and the digests were 
separated by electrophoresis on an 0.8% SDS-PAGE gel and 
transformed onto positive charged nylon membranes 
(Boehringer Mannheim) as described by J. Sambrook et al . 
30 ( supra ) . The membrane was then blotted with the DIG- 
labelled DNA probes according to the protocol of the, 
manufacturer . 



35 



5 . DNA Sequencing and Sequence Analysis 

The DNA fragments sequenced in this example were 
first cloned into plasmid pDELTA 1 (GIBCO BRL Life 
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Technologies, Burlington, Ontario) . A series of nested 
deletions were generated from both strands by in vivo 
deletion mediated by Tn 1000 transposon transposition 
(Deletion Factory System, GIBCO BRL) following the 
» procedures provided by the supplier. These deletions were 
sized by agarose gel electrophoresis and appropriate 
deletion derivatives were selected for sequencing by the 
dideoxynucleotide chain terminating method of Sanger 
St al. [ Proc. Natl. Acad. Sci. USA . 74, pp. 5463-5467 
(1977)]. To sequence the gaps between deletion templates, 
oligonucleotides were synthesized by oligonucleotide 
synthesizer 392 (ABI, Applied Biosystems Inc., Foster 
City, CA) . The se<guencing reaction was carried out by PGR 
(DNA Thermal Cycler 480®, Perkin Elmer) using the Taq 
DyeDeoxy Terminator Cycle Sequencing kit (ABI), and DNA 
electrophoresis was performed on automated DNA sequencer 
373A (ABI) . 

6. Expression of Cloned Gene in 

E, coli T7 RNA pol/promoter system 

High level expression of the cloned gene in this 
exan^le was achieved by employing the bacteriophage T7 RNA 
polymerase /promoter system in E. coli. The DNA fragment 
specifying the recombinant protein was ligated into 
plasmids pT7-5 or pT7-6 [S. Tabor and C.C. Richardson, 
Proc. Natl. Acad. Sci. USA , 82, PP. 1074-1078 (1985)], in 
a proper orientation in which the gene to be expressed was 
placed under the control of phage T7 RNA polymerase 
30 specific promoter <I>10. The resulting plasmid was 

transformed into E. coli strain BL21{DE3) [F.W. Studier, 
and B,A. Moffatt, J. Mol. Biol, , 189, pp. 113-130 (1986)] 
which carries the T7 RNA polymerase structural gene on its 
chromosome under the control of the inducible lacUVS 
35 promoter. Upon IPTG induction, the T7 RNA polymerase 
induced in the BL21(DE3) transf ormants specifically 
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transcribed the gene under the control of T7 promoter 4>ia. 
The overexpressed recombinant proteins were visualized by 
either Western blotting or Cooitiassie Blue staining. 

5 7 . N- terminal Amino Acid Sequence 

Analysis of HSP72 

Pneumococcal HSP72 was purified by 
immunoprecipitation using MAb Fl-Pn3.1 ( supra ) and samples 
of cell wall extracts of S. pneumoniae strain 64 prepared 
as described by L.S. Daniels et al . [ Microb. Pathogen. . 1, 
pp. 519-531 (1986)] as antigen. The imnnine precipitates 
were resolved by SDS-PAGE and then transferred to 
polyvinylidene difluoride (PVDF) membrane by the method of 
P. Matsudaira [ J. Biol. Chem. . 262, pp. 10035-10038 
(1987)]. PVDF membrane was stained with Coomassie Blue, 
the HSP72 band excised and then analyzed in an automated 
protein sequencer (ABI) , according to standard procedures. 

20 B. Construction of Plasmids Containing 

S. pneumoniae HSP72 Gene Fragments 
Corresponding to C-169 

The Xgtll S. pneumoniae genomic DNA library was 
25 screened with the HSP72-specif ic MAb Fl-Pn3.1. Seventeen 
(17) immunoreactive clones were isolated and purified from 
a total of 1500 phages tested. To confirm the specificity 
of the proteins expressed by the recombinant phages. 
Western blot analysis of the recombinant phage lysates was 
30 performed. Two groups of clones were identified among the 
17 positive clones recognized by MAb Fl-Pn3.1 and their 
representatives were designated as XJBD7 and XJBD17 for 
further characterization. As shown in FIG. 9, whole cell 
extracts from S. pneumoniae strain 64 (lane 1) and phage 
35 lysates from E. coll infected with XJBD17 (lanes 2 and 3) 
or XJBDI (lanes 4 and 5) cultured in the presence (+) or 
absence (-) of IPTG were subjected to 10% polyacrylamide 
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gel electrophoresis and were electrotransf erred to 
nitrocellulose. The immunoblot was probed v/ith HSP72~ 
specific MAb Fl-Pn3.1. Clone ^BD17 had two EcoRI-EcoRI 
insert fragments of 2 . 4 kb and 2.3 kb (FIG. 10), and 
expressed a chimeric recombinant protein having an 
apparent molecular mass of 7 4 kDa on SDS-PAGE gel (FIG. 9, 
lanes 2 and 3) . Clone XJBD7 was found to contain a 2.3 kb 
EcoRI insert fragment and produced an apparent fusion 
protein consisting of LacZ and the 74 kDa chimeric .protein 
expressed from clone XJBD17 . The fusion protein had an 
apparent molecular mass of 160 kDa as estimated by SDS- 
PAGE (FIG. 9, lane 5) . The expression of the chimeric 
recombinant protein encoded by phage XJBD17 was 
independent of IPTG induction (FIG. 9, lanes 2 and 3) 
while the expression of the recombinant fusion protein 
encoded by phage XJBD7 was dependent on induction of the 
lac promoter (FIG. 9, lanes 4 and 5) . 

In an attempt to subclone the HSP72 gene, the 
pneijmococcal DNA insert from clone X,JBD17 was extracted, 
purified and ligated into a low copy plasmid pWSK29 [R.F. 
Wang and S.R. Kushner, Gene, 100, pp. 195-199 (1991)] to 
generate plasmid pJBD171. The insert from pJBD171 was 
characterized by restriction mapping (Fig. lOB) , and. a 
series of subcloning and immunoblot ting was carried out to 
define the boundaries of the gene coding for the antigen 
reactive with MAb Fl~Pn3.1. The region responsible fo^ 
expression of the 74 kDa chimeric protein was found td 
localize on the 3 . 2 kb EcoRI-EcoRV fragment, which 
consists of the intact 2 . 4 kb EcoRI-EcoRI fragment and the 
0.8 kb EcoRI-EcoRV portion of the 2.3 kb EcoRI-EcoRI 
fragment. The plasmid carrying the 3.2 kb EcoRI-EcoRV 
insert was designated pJBD179. 
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C. Expression and DNA Sec[uence 
Analysis of a Chimeric Gene 
Coding for C~169 

To further determine the transcriptional 
direction of the gene coding for the 74 kDa chimeric 
protein on the 3.2 kb EcoRI-EcoRV fragment, and to 
increase the yield of the 74 kDa chimeric protein for 
immunological study, we decided to express the 74 kDa 
chimeric protein in the E, coli T7 KNA and T7 promoter 
system. The 3.2 kb EcoRI-EcoRV fragment, derived from 
PJBD179, was ligated into plasmids pT7-5 and pT7-6 in 
which the multi-cloning sites were placed in opposite 
orientation with respect to the T7 RNA polymerase specific 
T7 promoter OlO. The ligation mixture was used to 
transform E. coli JM109 and positive transf ormants 
reactive with MAb Fl-Pn3.1 were identified by the colony 
lifting method described by J. Sambrook et al. [ supra ] . 
The resulting recombinant plasmids, derived from pT7-5 and 
pT7-6, were designated pJBDfSl and pJBDf 62 , respectively. 
The intact 3 , 2 kb EcoRI-EcoRV insert in these recombinant 
plasmids and their orientation was determined by 
restriction mapping. To achieve overexpression of the 
74 kDa chimeric protein, pJBDfSl and pJBDf62 were 
transformed, separately, into E. coll BL21(DE3). The 
transf ormants were induced with IPTG (1 mM) for 3 hours at 
37°c. The cells were harvested, washed, resuspended in 
1% SDS and boiled for 10 minutes. The lysates were then 
used for SDS-PAGE and immunoblot analysis. As expected, 
both transf ormants produced the 7 4 kDa chimeric protein 
readily detected by Western blotting with MAb Fl-Pn3.1 
(FIG. 11). However, under the IPTG induction condition, 
only transf ormants BL21 (DE3 ) (pJBDf 51) overexpressed the 74 
kDa chimeric protein (FIG. IIA and B, lane 2) indicating 
that the transcriptional direction of the gene on the 3.2 
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kb EcoRI-EcoRV fragment is from the EcoRI end towards the__ 
EcoRV end {FIG. lOA) . 

The 3.2 kb EcoRI-EcoRV fragment was cloned into 
plasmid pDELTA 1 to yield plasmid pJBDAl . A series of 
5 overlapping deletions were generated and used as DNA 

sequencing templates. The DNA sequence of the entire 3.2 
kb EcoRI-EcoRV insert is SEQ ID N0:1. Two open reading 
frames ("ORFs") were found and their orientation is 
indicated in FIG. lOB ("ORF27'' and "FucI-HSP72 (C-169)"). 

10 In front of these two ORFs, putative ribosome-binding 

sites were identified (SEQ ID N0:1, nucleotides 18-21 and 
760-763) . No obvious -10 and -35 promoter sequences were 
detected. ORF27 spans nucleotides 30-755 (SEQ ID NO:l) and 
encodes a protein of 242 amino acids with a calculated 

15 molecular weight of 27,066 daltons. The deduced amino 
acid sequence of this protein is SEQ ID NO:2. We 
designated this gene orf27, and compared it to other known 
sequences. No homologous gene or protein was found. The 
large ORF (nucleotides 771-2912, SEQ ID NO : 1 ) specifies a 

20 protein of 714 amino acids with a predicted molecular mass 
of 79,238 daltons. The deduced amino acid sequence of 
this protein is SEQ ID NO:3. This ORF was con^ared with 
other known sequences to determine its relationship to 
other amino acid secjuences . This analysis revealed a high 

25 degree of similarity of the encoded protein to the 

sequence of E, coll fucose isomerase (Fuel) and to several 
HSP7 0 gene family members, also known as DnaK genes. 
Alignment of SEQ ID NO: 3 and those of the coll Fuel and 
HSP70 (Dnak) proteins indicated that the N-terminal 

30 portion corresponding to amino acids 1 to 545 (SEQ ID 

NO: 3) of the 74 kDa chimeric protein is highly homologous 
to E. coli Fuel, while the C- terminal portion 
corresponding to amino acids 546-714 (SEQ ID N0:3) is 
similar to HSP70 (DnaK) proteins. It is noteworthy that 

35 there is an EcoRI restriction site lying in the junction 
of these two portions of the gene coding for the 74 kDa 
protein (SEQ ID N0:1, between nucleotides 2404 and 2405). 
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Other restriction sites exist between nucleotides 971 and. 
972 (Pst I), nucleotides 1916 and 1917 (Pst I), 
nucleotides 1978 and 1979 (Xho I), and nucleotides 3164 
and 3165 (EcoRV) . From these data we concluded that the 
5 74 kDa protein was a chimeric protein encoded by two 

pieces of s. pneumoniae chromosomal DNA, a 2.4 kb EcoRi- 
EcoRI fragment derived from the Fuel homologous gene and a 
2.3 kb EcoRI-EcoRI fragment derived from the HSP72 gene. 

'0 D. Southern Blot Analysis 

Southern blotting was performed in order to 
confirm that the 74 kDa protein is a chimeric protein and 
to attempt to clone the entire pneximococcal HSP72 gene. ' 
Chromosomal S. pneumoniae DNA was digested with Hindlll to 
completion, separated on a 0.8% agarose gel, and 
transferred onto two positively charged nylon membranes 
(Boehringer Mannheim) . The membranes were then blotted 
with either the 0.8 kb EcoRI-EcoRV probe, derived from the 
2.3 kb EcoRI-EcoRI fragment, or the 1 kb Pstl-PstI probe, 
obtained from the 2.4 kb EcoRI-EcoRI fragment. Both 
probes had been previously labelled with digoxigenin-dUTP . 
These two probes hybridized two individual Hindlll 
fragments of different sizes (FIGS. lOB and IOC). The 0.8 
25 kb EcoRI-EcoRV probe recognized the 3 . 2 kb Hindlll 

fragment and the 1 kb Pstl-PstI probe reacted with the 4 
kb Hindlll fragment. This result further indicated that 
the gene responsible for the expression of the 74 kDa 
chimeric protein was generated by fusion, in frame, of two 
pieces of EcoRI fragments, one originated from the 
fragment containing the 5' portion of the S. pneumoniae 
Fuel homologue, the other derived from the segment 
carrying the C-169 fragment of the pneumococcal HSP72 
gene. The fact that the 0.8 kb EcoRI-EcoRV probe 
35 hybridized a single 3 . 2 kb fragment suggested that there 
is only a single HSP72 gene copy in S. pneumoniae. 



20 
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E. Production of Recombinant HSP72 

A partial pneumococcal genomic library was 
5 generated by ligation of the pool of Hindlll digests of 
chromosomal DNA, with sizes ranging from 2.8 to 3.7 kb, 
into plasmid pWSK29/HindIII . The ligation mixture was 
used to transform E. coli strain JM 109 and the 
transf ormants were screened by hybridization with the 0.8 

10 kb EcoRI-EcoRV probe. One representative plasmid from 
four positive hybridizing clones was named pJBD291. 
Restriction analysis of the insert and Western blot of the 
cell lysate of transf ormants were employed to verify that 
the plasmid pJBD291 indeed carries the 3.2 kb Hindlll 

15 fragment containing the HSP72 gene expressing the 

recombinant HSP72 protein (FIG. lOB) . The HSP72 protein 
expressed by the transf ormants {pJBD291) migrated on the 
SDS-PAGE gel at the same position as the native HSP72 
protein (FIG. 12). To sequence the entire HSP72 gene and 

20 to overexpress the full-length HSP72 protein, the 3.2 kb 
Hindlll fragment was isolated from plasmid pJBD291, and 
subcloned into plasmids pDELTA 1 and pT7-5 to generate 
pJBDA4 and pJBDkSl, respectively. 

The entire 3 . 2 kb Hindlll DNA fragment carried 
25 on the plasmid pJBDA4 and the 2.3 kb EcoRI-EcoRI DNA 

fragment contained on the plasmid pJBD177 were sequenced. 
Altogether, the nucleotide sequence comprised 4320 base 
pairs and revealed two ORFs (SEQ ID NO: 4). The first ORF, 
starting at nucleotide 682 and ending at nucleotide 2502 
30 (SEQ ID NO:4), was identified as the pneumococcal HSP72 

gene, and the second ORF, spanning from nucleotide 3265 to 
nucleotide 4320 (SEQ ID N0:4), was located 764 base pairs 
downstream from the HSP72 structural gene and was 
identified as the 5 ' portion of the pneumococcal DnaJ 
35 gene. The putative ribosome binding site ( "AGGA" ) was 

located 9 base pairs upstream from the start codon of the 
HSP72 structural gene, while the typical ribosome binding 
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Site ("AGGA") was found 66 base pairs upstream from the - 
start codon of the DnaJ structural gene. No typical 5' 
regulatory region was identified in front of these two 
genes. Restriction sites are located between nucleotides 
5 1 and 2 (Hindlll) , nucleotides 1318 and 1319 (EcoRI), 
nucleotides 1994 and 1995 (EcoRI), nucleotides 3343 and 
3344 (Hindlll), and nucleotides 4315 and 4316 (EcoRI). 
The gene organization of HSP72 (DnaK) and DnaJ in 
S, pneumoniae is similar to that of E. coll [Saito, H. and 
10 Uchida, Mol. Gen. Genet. 164, 1-8 (1978)] as well as 
several other Gram positive bacteria [Wetzstein, M. 
et al., J. Bacteriol. 174, 3300-3310 (1992)]. However, 
the intragenic region of S. pneumoniae is significantly 
larger and no ORF for the grpE gene was found upstream of 
15 the HSP72 (DnaK) structural gene. 

The predicted HSP72 protein has 607 amino acids 
and a calculated molecular mass of 64,755 daltons, as 
compared to the 72 kDa molecular mass estimated by SDS- 
PAGE. The predicted HSP72 protein is acidic with an 
20 isoelectric point (pi) of 4.35. Automated Edman 

degradation of the purified native HSP72 protein extracted 
from S. pneumoniae strain 64 revealed SKIIGIDLGTTN-AVAVLE 
as the 19 amino acid N- terminal sequence of the protein. 
The amino- terminal methionine was not detected, presumably 
25 due to in situ processing which is known to occur in many 
proteins. No amino acid residue was identified on 
position 13. The 19 amino acid N-terminal sequence 
obtained from the native HSP72 protein is in full 
agreement with the 19 amino acid N-terminal sequence 
30 deduced from the nucleotide sequence of the recombinant 

S. pneumoniae HSP72 gene (SEQ ID NO: 5) thus confirming the 
cloning. This N-terminal sequence showed complete 
identity with the DnaK protein from Lactococcus lactls and 
68.4% identity with the DnaK protein from Escherichia 
35 Coll. Similarly, the alignment of the predicted amino 

acid sequence of HSP72 (SEQ ID NO: 5) with those from other 
bacterial HSP70 (DnaK) proteins also revealed high 
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homology (FIGS. 13A-13D) . For example, HSP72 showed 54% - 
identity with the E. coli DnaK protein. The highest 
identity value was obtained from comparison with the Gram 
positive bacterium Lactococcus lactls, showing 85% 
5 identity with HSP72. Like other HSP70 proteins of Gram 
positive bacteria, HSP72 misses a stretch of 24 amino 
acids near the amino terminus when compared with DnaK 
proteins from Gram negative bacteria (FIGS. 13A-13D) . 

Although HSP72 shares homology with HSP70 (DnaK) 

10 proteins from other organisms, it does possess some xinique 
features. Sequence divergence of the HSP70 (DnaK) 
proteins is largely localized to two regions (residues 244 
to 330 and 510 to 607, SEQ ID N0:5). More specifically, 
the peptide sequences GFDAERDAAQAALDD (residues 527 to 

15. 541, SEQ ID NO: 5) and AEGAQATGNAGDDW (residues 586 to 
600, SEQ ID NO:5) are exclusive to HSP72 . The fact that 
the C-terminal portion of HSP72 is highly variable 
suggests that this portion carries antigenic determinants 
specific to S. pneumoniae. Consistent with this 

20 hypothesis, monoclonal antibodies directed against the C- 
169 fragment of HSP72 ( infra ) , were not reactive with 
E. coli and S. aureus, which are known to express DnaK 
proteins similar to HSP72 . 

The truncated DnaJ protein of S, pneumoniae (SEQ 

25 ID NO: 6) has 352 amino acids, which show a high degree of 
similarity with the corresponding portions of the L. 
lactis DnaJ protein (72% identity) and the E, coli DnaJ 
protein (51% identity) . The predicted truncated DnaJ 
protein contains high glycine content (15%). Four Gly-, 

30 Cys-rich repeats, each with the Cys-X-X-Cys-X-Gly-X-Gly 
motif characteristic of DnaJ proteins [P. A. Silver and 
J.C. Way, Cell , 74, pp. 5-6 (1993)], were identified 
between amino acids 148 and 212 of the S. pneumoniae DnaJ 
protein (SEQ ID NO : 6 ) . Three repeated GGFGG secjuences 

35 (residues 75-79, 81-85, and 90-94) were found near the 
terminus . 
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10 



15 



20 



25 



30 



35 



F. Reactivity of MAbs Against 
Recombinant Antigens 



The four HSP72 specific MAbs (Fl-Pn3.1, F2- 
Pn3.2, F2-Pn3.3 and F2-Pn3.4, supra ) were tested for their 
reactivity against proteins expressed by E. coli infected 
or transformed with recombinant phages and plasmids 
containing HSP72 sequences. The four individual MAbs 
reacted with the lacZ-HSP72 fusion protein expressed by 
the clone >.JBD7, thus localizing the epitopes recognized 
by these MAbs to the C-terminal 169 residues. 
Surprisingly, the proteins encoded by the pneumoccocal 
inserts in XJBD17 and pJBDAl were recognized by only 3 of 
4 Mabs. These results suggest that although the C-169 
fragments synthesized in E. coli infected with XJBD7 and 
X.JBD17 have the same primary structure, they have distinct 
conformation. The lack of reactivity of MAb F2~Pn3.2 with 
some recombinant proteins raised the possibility that this 
particular MAb recognizes a more complex epitope. 
Although complex, F2-Pn3.2 epitopes are still recognizable 
on Western immunoblots. The complete HSP72rec protein 
expressed by E. coli containing the recombinant plasmid 
PJBDA4 was reactive v/ith all four MAbs. 

EXAMPLE 4 - Antigenic Specificity and 

Reactivity of HSP72-Specif ic 
Monoclonal Antibodies 

The reactivity of MAbs Fl-Pn3.1, F2-3.2., F2- 
Pn3.3 and F2-Pn3.4 to a collection of bacterial strains 
including 20 S. i^neumoniae strains representing 16 
capsular serotypes (types 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 
12, 14, 15, 19, 20, and 22) and the 17 non-pneumococcal 
bacterial strains listed in Table 2, was tested using a 
dot enzyme immunoassay as described by D. Martin et al . 
[supra] and immunoblotting . For dot enzyme immunoassay, 
the bacteria were grown overnight on chocolate agar plates 
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and then suspended in PBS, pH 7.4. A volume of 5 ]il of a 
suspension containing approximately 10^ CFU/ml was applied 
to a nitrocellulose paper, blocked with PBS containing 3% 
bovine serum albumin, and then incubated sequentially with 
5 MAbs and peroxydase- labeled secondary antibody. Whole 
cell extracts were prepared for Western blot analysis by 
boiling bacterial suspensions in sample buffer for 5 
minutes . 



TABIiE 2: LIST OF NON-PNEUMOCOCCAIi ISOLATES 
TESTED BY DOT ENZYME IMMONOASSAY 



Stirain 
Designation 


Genus species 


group 


or 


C-2 


Streptococcus 


pyogenes 


group 


A 


C-3 


Streptococcus 


agalactiae 


group 


B 


C-7 


Enterococcus : 


f aecalis 


group 


D 


C-9 


Streptococcus 


bovis 


group 


D 


C-14 


S trep tococcus 


mutans 






C-15 


Streptococcus 


salivarius 






C-19 


Streptococcus 


sanguis 


I 




C-20 


Streptococcus 


sanguis 


I 




C-21 


Streptococcus 


sanguis 


I 




C-22 


Streptococcus 


sanguis 


II 




C-23 


Streptococcus 


sanguis 


II 




C-24 


Streptococcus 


sanguis 


II 




C-25 


Streptococcus 


sanguis 


II 




C-27 


Gemella morbillorum 






C-30 


Staphylococcus aureus 






C-33 


Bacillus 








C-36 


Escherichia coli 







10 

When tested by dot enzyme immunoassay, each MAb 
reacted with each of the S. pneumoniae strains and none of 
the non-pneumococcal isolates. These results were 
unexpected since con^arison studies revealed that HSP72 is 
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very similar to other knovm bacterial HSP7 0 (DnaK) 
proteins, for example those from £. coli and S. aureus. 



investigate the immunoreactivities of our MAbs . As shown 
5 in Table 3, each MAb exhibited some reactivity. Although 
the percent identity of the E. coli amino acid sequence 
and the HSP72 amino acid sequence (SEQ ID NO: 5) is 54%, 
the four HSP7 2 -specific MAbs did not recognize the E, coli 
HSP70 (DnaK) protein. Similarly, the HSP72 -specif ic MAbs 
did not react with the C. trachomatis HSP70 (DnaK) 
protein, which has 56% amino acid identity with the amino 
acid sequence of HSP72. High amino acid sequence homology 
is observed between HSP72 and the HSP70 (DnaK) proteins 
from gram positive bacterial species. However, again, 
none of the HSP72-specif ic MAbs reacted with S. aureaus or 
Bacillus gram positive species, which exhibit 74% and 76% 
amino acid sequence homology, respectively, with HSP72 . 
From these data it is clear that although HSP70 (DnaK) 
proteins may be structurally related to HSP72, they are 
immunologically distinct. Among the non-pneumococcal 
isolates that reacted with at least one MAb, there is S. 
pyogenes, Enterococcus faecalis, S. mutans and 5. sanguis, 
which, all belong to the Streptococcus or Streptococcus- 
related Enterococcus genus. So far, neither the HSP70 
protein, nor the gene structure has been identified in 
these Streptococcus or Enterococcus species- Altogether, 
these observations indicate that hypervariable amino acid 
sequences or residues within HSP70 (DnaK) proteins are 
involved in antigenicity. Interestingly, immunoblotting 
analysis revealed that there was no significant variation 
in the molecular mass of the HSP7 0 (DnaK) proteins among 
both S. pneumoniae isolates and immunoreac tive non- 
pneumococcal isolates . 



Immunoblots were then performed to further 



58 




wo 96/40928 PCT/CA96/00322 



TABLE 3: REACTIVITY OF MASS WITH NON- PNEUMOCOCCAL ISOLATES IN 

WESTERN IHHONOBLOTTING 



Bacterial Strain MAbs 



Designation 


genus /species 


type 






F2- 


F2- 


F2- 










PN3.1 


Pn3.2 


PN3.3 


Pn3.4 


C-2 


Streptococcus 


group 


A 


- 




- 






pyogenes 














03 


Streptococcus 


group 


B 


- 


- 


- 


- 




agalactiae 














C-7 


Enterococcus 


group 


D 


- 


+ 


- 


- 




faecal Is 














C-9 


Streptococcus 


group 


D 


- 


- 


- 


_ 




bovis 














C-14 


Streptococcus 






- 


+ 


- 


± 




mutans 














C-15 


Streptococcus 






- 


- 


- 


- 




salivarius 














C-19 


Streptococcus 


I 




+ 










sanguis 














C-20 


Streptococcus 


I 




+ 


+ 


_ 


+ 




sanguis 














C-21 


Streptococcus 


I 




+ 


+ 


+ 


+ 


















C-22 


Streptococcus 


II 








4* 






sangui s 














C-23 


tj I L L/L» w V» Um^ 


II 














sanguis 














C-24 


Streptococcus 


II 




+ 


+ 


+ 


+ 




sanguis 














C-25 


Streptococcus 


II 




+ 


+ 


+ 


+ 




sanguis 














C-27 


Gemella 
















morbillorum 














C-30 


Staphyl ococcus 
















aureus 














C-33 


Bacillus 














C-36 


Escheri chla 
















CO J 2 














C-RP 


Chlamydia 


L2 














trachoma zis^ 















* ± indicates a weak signal compared to the reactivity 
observed with S. pneumoniae antigens 

C. trachomatis purified elementary bodies were tested. 
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EXAMPLE 5 - Purification of HSP72 And Its 
Use As An Immunogen to Protect 
Against Lethal Pnevimoniae Infection 



5 



A. 



Procedures 



1. 



Preparation of Purified 
Recombinant HSP72 Protein 
and Recombinant C-169 



10 



High level exclusive expression of the HSP72 



gene was achieved by eir^loying the bacteriophage T7 RNA 
polyinerase/T7 promoter system in E. coll. The 3.2 kb 

15 Hindlll fragment was cloned in both orientations in front 
of the T7 promoter <I)10 in the plasmid pT7-5. The 
resulting plasmid pJBDkSl was then transformed into 
E. coll strain BL21 (DE3). Overexpression of the 
recombinant HSP72 protein (HSP72r«c) was induced by 

20 culturing in broth supplemented with antibiotics for a 3- 
hour period after the addition of IPTG to a final 
concentration of 1 mM. E. coll expressing high levels of 
HSP72r«c were concentrated by centrifugation and lysed by 
mild sonication in 50 mM Tris-Cl (pH 8.0), 1 mM EDTA and 

25 100 mM NaCl lysis buffer containing 0.2 mg/ml lysozyme. 
The cell lysates were centrifuged at 12,000 g for 15 
minutes and the supernatants were collected. HSP72«c was 
purified by immunoaf f inity using monoclonal antibody Fl- 
Pn3 . 1 immobilized on sepharose 4B beads (Pharmacia). The 

30 purity of eluates was assessed on SDS-PAGE. 



expressed in the form of insoluble inclusion bodies in 
E. coll strain JM109 transformed with the plasmid pJBDAl . 
Protein inclusion bodies were recovered from pelleted 
35 bacterial cells disrupted by sonication as described 
before. The pellets were washed in lysis buffer 
containing 1 mg/ml of deoxycholate to remove contaminating 
materials, and the protein inclusion bodies were then 
solubilized in urea 6 M. The protein solution was 



The recombinant C-169 protein (C-169roc) was 
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centrifuged at 100,000 g and the cleared supernatant 
collected and dialysed against phosphate-buffered saline. 
After purification, the protein content was determined by 
the Bio-Rad protein assay (Bio-Rad Laboratories, 
5 Mississauga, Ontario, Canada) . 

2 . Active Immunoprotection Studies 

Two groups of 10 female Balb/c mice < Charles 

10 River Laboratories) were immxinized subcutaneously three 
times at two-week intervals with 0.1 ml of purified 
HSP72r«c or C-169r«c antigens absorbed to Alhydrogel 
adjuvant. Two antigen doses, approximately 1 and 5 ug, 
were tested. A third group of 10 control mice were 

15 immunized identically via the same route with Alhydrogel 
adjuvcint alone. Blood sanples were collected from the 
orbital sinus prior to each immznunization and five to 
seven days following the third injection. The mice were 
then challenged with approximately 10* CFU of the type 3 

20 S. pneumoniae strain WU2 . Samples of the 5. pneumoniae 

challenge inoculiim were plated on chocolate agar plates to 
determine the CFU and to verify the challenge dose. 
Deaths were recorded at 6-hour intervals for the first 3-4 
days post- infection and then at 24-hour intervals for a 

25 period of 14 days. On days 14 or 15, the surviving mice 
were sacrificed and blood samples tested for the presence 
of S. pneumoniae organisms. Antibody responses to the 
recombinant HSP72 antigens are described in Exan^le 7 . ^ 

i 

30 3 . Passive Immunoprotection Studies 

One NZW rabbit (Charles River Laboratories) was 
immunized subcutaneously at multiple sites with 
approximately 50 \ig of the purified C-169r«c protein 
35 adsorbed to Alhydrogel adjuvant. The rabbit was boosted 
three times at two-week intervals with the same antigen 
and blood samples collected 7 and 14 days following the 
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last immunization. The serum samples were pooled and 
antibodies were purified by precipitation using 40% 
saturated ammonixim sulfate. 

Severe-combined immunodef icient SCXD mice were 
5 injected intraperitoneally with 0.25 ml of the purified 

rabbit antibodies 1 hour before intravenous challenge with 
5000 or 880 CFU of the type 3 S. pneumoniae strain WU2 . 
Control SCID mice received sterile buffer or antibodies 
purified from nonimmune rabbit sera. Samples of the 
10 S. pneumoniae challenge inoculum were plated on chocolate 
agar plates to determine the CFU and to verify the 
challenge dose. The SCID mice were chosen because of 
their high susceptibility to S, pneumoniae infection, 
Blood samples (20 \il each) obtained 24 hours post- 
15 challenge were plated on chocolate agar and tested for the 
presence of S. pneumoniae organisms. The level of 
detection was 50 CFU/ml . Deaths were recorded at 24~hour 
intervals for a period of 5 days . 

20 B. Results 

The availability of cloned S. pneumoniae DNA 
inserts encoding the complete or partial (C-169) HSP72 
protein and the expression of recombinant proteins in 

25 E. coli allowed the obtention of purified proteins useful 
for the investigation of the vaccinogenic potential of 
HSP72 protein. Both HSP72r«c and C-169„c proteins were 
obtained in a relatively pure state with no contaminants 
detected on Coomassie Blue-stained SDS polyacrylamide gels 

30 (FIGS. 14 and 15, respectively). 

To evaluate the vaccinogenic potential of HSP72, 
we first examined the ability of HSP72^c to elicit a 
protective immune response. Groups of 10 mice were 
immunized with full-length HSP72r«c (1 ug or 5 pg dose) and 
35 challenged with 4.2 million CFU of S. pneumoniae type 3 

strain WU2 . Eighty percent (80%) of the mice dosed with 1 
]ig HSP72r.c survived the challenge, as did 50% of the mice 
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dosed with 5 pg HSP72. None of the naive mice immunized 
with Alhydrogel adjuvant alone without antigen survived 
the challenge (FIG. 16). No S. pneumoniae organisms were 
detected in any of the blood samples collected on days 14 

5 or 15 from mice surviving infection. The observation that 
HSP72r*ic elicited protection against type 3 strain WU2 
pneiimococci indicated that HSP72 derived from DNA 
extracted from a type 6 strain contains epitopes capable 
of eliciting protection against a heterologous strain 

10 having a different capsular type. 



HSP72 protein by using recombinant protein fragments 
expressed from E. coll transformed with a chimeric fuel- 
HSP72 gene. Mice immunized with purified C-169r«c were 
15 protected from fatal pneumococcal challenge, thus 

demonstrating that some, if not all, epitopes eliciting 
protection are present in the C-terminal region of the 
HSP72 molecule comprising the last 169 residues. Groups 
of 10 mice were immunized with C-169r«c (1 ug or 5 ug 
20 doses) and challenged with 6 million CPU of S. pneumoniae 
type 3 strain WU2 . Sixty percent (50%) of the mice dosed 
with 1 ug C-169r*c survived the challenge, as did 7 0% of 
the mice dosed with 5 ug C-169rac (FIG. 17). In contrast, 
all of the naive mice were dead by 2 days post-challenge. 
25 Therefore, the C-terminal portion of 5. pneumoniae HSP72, 
which includes the region of maximum divergence among DnaK 
proteins, is a target for the protective immune response. 



experiments demonstrated that SCID mice passively 
30 transferred with rabbit anti-C-169r«c antibodies were 

protected from fatal infection with S. pneumoniae VJU2 . In 
contrast, none of the 15 control mice survived. The 
control mice received antibodies from nonimmune rabbit 
sera or received sterile buffer alone. In addition, all . 
35 mice from the control groups had positive S. pneumoniae 
hemoculture 2 4 hours post -challenge, while S. pneumoniae 



We further examined the immune response to the 



As illustrated in Table 4 below, two independent 
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organisms were detected in only 2 out of a total of 10 
immunized SCID mice. 



TABIiE 4: PASSIVE XMMIINIZATZON STUDIES SHOWING PROTECTION 
OF SCID MICE FROM EXPERIMENTAL S. PNEUMONIAE INFECTION BY 
ANTI-C-169roc RABBIT ANTIBODIES 



Experiment 


Injection 


No. of Mice 


No* of Mice 






Surviving 


Testing 






Cliallenge 


Positive for 






after 5 days 


the Presence of 








S « pneumoniae 


1 


sterile 


0/5 


5/5 




buffer 








anti-C-169rec 


4/5 


2/5 




control 


0/5 


5/5 




antibodies 






2 


sterile 


0/5 


5/5 




buffer 








anti-C-169roc 


5/5 


0/5 



5 

In experiments 1 and 2 (Table 4) , mice were 
challenged with 5000 and 880 CFU of type 3 S. pneumoniae 
strain WU2, respectively. Results in Table 4 are 

10 expressed as the number of mice surviving challenge, or 
testing positive for the presence of S. pneumoniaG , 
compared to the total number of mice in each group. 

Demonstration of the anti-HSP72 specificity of 
the antibody elicited by immunization with recombinant 

15 HSP72 or C-169 proteins came from Western Blot analyses 
using S. pneumoniae cell lysates as antigens. A single 
band corresponding to HSP72 was detected by all rabbit ^and 
mouse antisera tested. These serologic results suggested 
that the protection following the immunization with 

20 recombinant proteins was due to the production of 
antibodies reactive with S. pneumoniae HSP72. 

EXAMPLE 6 - Heat -Inducible Expression System for High 
Level Production of the C- 151 Terminal Portion of the 
25 HSP72 Protein 
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A. Construction of Plasmid pURV3 Containing the C- 
151 terminal coding region of the HSP72 of S, 
pneumoniae 

5 The DNA region coding for 151 amino acids at 

the carboxyl end of the HSP72 of S. pneumoniae was 
inserted downstream of the promoter X PL into the 
translation vector p629 [H, J. George et al., 
Bio/Technology 5, pp. 600-603 (1987)]. This vector 
contains a cassette of the bacteriophage X cI857 
temperature sensitive repressor gene from which the 
functional Pr promoter has been deleted. The inactivation 
of the CIS 57 repressor by a temperature increase from the 
ranges of 30-37^C to 37-42°C results in the induction of 
15 the gene under the control of X PL. The induction of gene 
expression in E. coli cells by a temperature shift is 
advantageous for large scale fermentation since it can 
easily be achieved with modern f ermenters . However, it 
should be understood that while E. coli was the 
microorganism of choice in the experiments herein 
described, other host organisms, such as yeast, are 
intended to be included within the scope of this 
invention. 

A fragment of 477 nucleotides, including the 
region of 457 bases between 2050 to 2506 in HSP72 gene of 
S. pneumoniae (see SEQ ID NO 4), was amplified by che 
polymerase chain reaction (PCR) from the S. pneumoniae 
type 6 strain 64 genomic DNA using the oligonucleotide 
primers OCRR2 6 ( 5 ' -GGCAGATCTATGAAGGCCAAAGACCTTGGAAC ) 
and OCRR27 ( 5 ' -CGCGGATCCTTACTTTTCCGTAAACTCTCCGT) . 
Chromosomal DNA was prepared from a 90 ml culture of 
exponent ionally growing cells of S. pneumoniae in heart 
infusion broth using the method of Jayarao et al . [J. 
Clin. Microbiol., 29, pp. 2774-2778 (1991)]. DNA 
amplification reactions were made using a DNA Thermal 
Cycler, Perkin Elmer, San Jose, CA. In OCRR26, an ATG 
start codon is present in frame just upstream of the 



20 



25 



30 



35 



65 



BNSOOCID: <WO_964092BA1_I_> 



m 



wo 96/40928 



PCT/CA96/00322 



coding region for the amino- terminus region of the C-151^ 
The primers OCRR26 and OCRR27 contain, respectively, a 
BgrllT (AGATCT) and a BaxnHl (GGATCC) recognition site in 
order to facilitate the cloning of the PGR product into 
5 the dephosphorylated restriction sites BgllX and BarhEl of 
p629. The PGR product was purified from agarose gels by 
the method of phenol freeze [S. A. Benson, Biotechniques 
2, pp. 67-68 (1984)] and digested with the restriction 
enzymes BglXX and BairiHI . The Bglll-BairiHI fragment of 471 

10 base pairs was then ligated into the Bglll and BamHI 

recognition sites dephosphorylated of p629 . A partial map 
of the resulting plasmid pURV3 is shown in FIG. 18. This 
plasmid was transformed by the method of Simanis [Hanahan, 
D. Jji D. M, Glover (ed.), DNA Cloning, pp. 109-135, 

15 (1985)] into the E. coli strain XLI Blue MRF ' {A(mcrA) 183 
AimcrCB-hsdSMR-mrr) 173 endAl supE44 thi-1 recAl gyrA96 
relAl lac [F' proAB l^cI^ZAMlS TnlO {Tet^)]<^ ) which was 
obtained from Stratagene, La Jolla, CA. The transf ormants 
grown at 37°C were screened by colony immunoblot [J. 

20 Sambrook et al . ( supra ) ] using the MAb Fl-Pn3.1 reactive 
with C-1693^ec- Plasmid DNA was purified from a selected 
trans formant and the DNA insert was sequenced by PGR using 
the Taq Dye Deoxy Terminator Cycle Sequencing kit of 
Applied Biosystems Inc. (ABI) and DNA electrophoresis was 

25 performed on automated DNA sequencer 373A (ABI) . The 

nucleotide sequence of the insert perfectly matched the 
nucleotide sequence of the C-151 coding region of the 
HSP72 gene. (See SEQ ID No: 25 and corresponding amino 
acid sequence at SEQ ID No : 2 6.) The plasmid was 

30 transformed into the prototrophic E. coli strain W3110 
(ATCG 27325) for the production of G-lSl^ec- 

B. Expression of G-lSlrec and Antigen 
Preparation 

The recombinant C-lSl^ec ^as synthesized with a 
methionine residue at its amino end in E. coli strain 
W3110 harboring the plasmid pURV3 . E, coli cells were 
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grown at 3 0°C in LB broth containing 100 ug of ainpicillin- 
per ml until the A500 reached a value of 0.6. The cells 
were then cultivated at 40°C for 18 hours to induce the 
production of C-lSl^-ec protein. A semi -purified C-lSlj^^c 
5 protein was prepared using the following procedures. The 
bacterial cells were harvested by centrifugation and the 
resulting pellet was washed and resuspended in phosphate- 
buffered saline. Lysozyme was added and the cells were 
incubated for 15 min on ice before disruption by pulse 

10 sonication. The cell lysates were cleared by 

centrifugation and the supematants were collected and 
subjected to separation using an Amicon's ultrafiltration 
equipment (stirred cells series 8000, Amicon Canada Ltd. 
Oakville, Ontario) . The ultraf iltrate not retained by a 

15 YM3 0 membrane was recovered, analysed by SDS-PAGE and 

stained with Coomassie blue R-250. Protein concentrations 
were estimated by comparing the staining intensity of the 
^"ISlj^-ec protein with those obtained with defined 
concentrations of soybean trypsin inhibitor. 

20 

C . Reactivity of MAbs Against C-151rec 
A panel of 10 monoclonal antibodies selected 
for their reactivity with the_S. pneumoniae HSP72 protein 
were tested for their reactivity to C-lSl^ec Western 

25 blot analysis using YM30-ultraf iltrates prepared as 
described above. The MAbs included a series of six 
monoclonal antibodies raised to the HSP722.ec protein (F3- 
Pn3.5 to F3-Pn3.10) and monoclonal antibodies Fl-Pn3.1, 
F2-Pn3.2, F2-Pn3.3, F2-Pn3.4. The three MAbs Fl-Pn3.1, F2- 

30 Pn3.3 and F2-Pn3.4 that were reactive with C-ieS^-ec also 
recognized the C-lSlj-ec fragment. All other MAbs were only 
reactive with HSP72i.ec thus indicating that they may be 
directed against epitopes present in the amino terminal 
region of the HSP72 protein. 

35 
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EXAMPLE 7 - Antibody Response of Balb/c Mice and Macaca- 
Fascicularis (cynomolgus) Monkeys to Recombinant HSP72 
Antigens 

A. Procedures 
' 1 • Immunization of Animals 

Groups of 10 female Balb/c mice were immunized 
subcutaneously with either HSP72 or C-169 j-ec as 

described in Example 5. In order to assess the antibody 
response to C-lSlrec a group of 6 mice were immunized 
three times at two-week intervals with 0.5 of €-151^^^ 
absorbed to Alhydrogel adjuvant by intraperitoneal 
injection. Sera from blood samples collected prior each 
immunization and four to seven days after the third 
immunization were tested for antibody reactive with S. 
pneumoniae by ELISA using plates coated with S. pneumoniae 
cell wall extracts. 

Female cynomolgus monkeys were immunized 
intramuscularly at Day 1, 22 and 77 with 0.5 ml containing 
150 ug of purified HSP72rec or C-169rec antigens absorbed 
to Alhydrogel adjuvant. Blood sair^les were collected 
regularly before and after each immunization and the sera 
were tested for antibody reactive with S. pneumoniae HSP72 
antigen by Western blot analysis. 

The specificity of the raised antibodies for_5. 
pneumoniae HSP72 was confirmed by Western blot analyses to 
S. pneumoniae cell extracts and purified recombinant 
antigens . 



B. Results 

The results previously described in Example 5 
clearly demonstrate the protective nature of the antibody 
response elicited following immunization with recombinant 
HSP72 antigens. Here we monitored the appeareunce of servim 
antibody response in mice (FIG. 19, 20 and 21) and in 
monkeys (FIG. 22) during the immunization schedule. Both 
species responded strongly to the full-length and 
trxincated recombinant HSP72 proteins used as immunogens 



68 



• 




wo 96/40928 PCT/CA96/00322 

with average titers of 1:64000 after the third injection. - 
Detailed analysis of individual sera revealed that each 
animal responded to the immunization in developping 
antibodies reactive with S. pneumoniae HSP72 . 
5 In mice immunized with C-169rec' doses 

tested, i.e. 1 and 5 jig* were similarly efficient with the 
induction of similar antibody titers (FIG. 20) . A strong 
boost response was observed after the second injection 
with C-169j^ec with no enhancement in the antibody titers 
10 after a third injection. In contrast to this, we observed 
that the immune response to the HSP72j-ec was dose- 
dependent . Increases in the specific antibody titers were 
observed after a second and a third injection with either 
HSP72j.ec or C-lSl^ec (FIG. 19 and 21) . 

Study of the immune response of monkeys clearly 
indicated that the immunogenic ity of recombinant HSP72 
antigens is not restricted to rodents such as rabbit and 
mouse. The humoral response following the second 
injection with either antigen is characterized by a strong 
increase in HSP72-specif ic antibody titers that can 
persist for several weeks without any detectable decrease 
in their antibody titers (FIG. 22). In addition, specific 
serum antibodies were detectable in the sera of each 
monkey after a single injection of recombinant antigens. 
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EXAMPLE 8 - B-Cell Epitope Mapping- of HSP72 Stress 
Protein 

In Example 3, it was shown that significant 
variability in the primary sequence of the HSP70 proteins 
was mainly localized to two regions corresponding to amino 
acid residues 244 to 330 and 510 to 607 of the S. 
pneumoniae HSP72 protein. These variable regions may 
contain B-cell epitopes responsable for the antigenic 
heterogeneity reported in Exanple 4. To investigate this 
possibility, the reactivity of polyclonal and monoclonal 
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antibodies to S. pneumoniae HSP72 were tested against 
fourteen peptides selected to cover most of these regions, 
A. Procedures 

Fourteen peptides of 14 to 3 0 amino acids 
5 residues were synthesized. The peptide sequences and 

their locations in the protein are sximmarized in Table 5 . 
Peptides CS870, CS873, CS874, CS875, CS876, CS877, CS878, 
CS879, CS880 and CS882 were synthesized by Biochem 
Immunosystem Inc. (Montreal, Canada) using an automated 

10 peptide synthesizer. Peptides MAPI, MAP2, MAP3 and MAP4 
were synthesized onto a branching lysine core as Multiple 
Antigenic Peptides (MAP) by the Seirvice de Sequence de 
Peptides de I'Est du Quebec, Centre de recherche du CHUL. 
(Sainte-Foy, Canada) . Peptides were purified by reverse- 

15 phase high-pressure liquid chromatography. Peptides were' 
solubilized in distilled water except for peptides CS874 
and CS87 6 which were solubilized in a small voliime of 
either 6M guanidine-HCl or dimethyl sulfoxide and then 
adjusted to 1 mg/ml with distilled water. 

20 Peptide ELISA were performed by coating synthetic 

peptides onto Immunolon 4 micro titration plates (Dynatech 
Laboratories, Inc., Chantilly, VA) at a concentration of 
• 50 ug/ml according to the prodedures described in J. Hamel 
et al. [ supra] . To confirm the reactivity of MAbs with 

25 peptides, the ability of fluid-phase peptides to inhibit 
MAb binding to solid HSP72 was determined. For the 
inhibition assay, microtitration plates were coated with 
S. pneumoniae cell wall extracts. Hybridoma culture 
supernatants containing the HSP72-specif ic MAbs were 

30 incubated overnight at 4°C with several concentrations of 
peptide. Peptide treated and control supernatants were 
then tested by ELISA as described above. 

Immune sera were from animals immunized three 
times with recombinant HSP72 antigens. One rabbit was 

35 immunized with 37.5 \xg of purified HSP72rec according to 
the immunization protocol described in Example 5 . Pool 
murine sera were from three Balb/c mice immunized with 
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HSP72j.ec from Example 5 and monkey pool sera were from 
groups of two animals immunized with either HSP722-ec or C- 
169rec- 

5 TABLE 5: SEQUENCES AND LOCATIONS OF SYNTHETIC PEPTIDES 

CORRESPONDING TO S. PNEUMONIAE HSP72 
AMINO ACID RESIDUES 



J Peptide 


Location 


Sequence 


Sequence | 
ID Ho. 


CS876 


247-261 


TSTQISLPFITAGEA 


7 


CS877 


257-271 


TAGEAGPLHLEMTLT 


8 


CS878 


268-281 


MTLTRAKFDDLTRD 


9 


CS879 


276-290 


DDLTRDLVERTKVPV 


10 


CS880 


286-299 


TKVPVRQALSDAGL 


11 


CS882 


315-333 


RIPAWEAYKAETGKEPNK 


23 


CS873 


457-471 


KAKDLGTQKEQTIVI 


12 


CS874 


467-481 


QTIVIQSNSGLTDEE 


24 


CS875 


477-491 


LTDE IDRMMKDAEA 


13 


MAP 1 


487-510 


KDAEANAESDKKRKEEVDLRNEVD 


14 


CS870 


507-521 


NEVDQAIFATEKTIK 


15 


MAP 2 


517-544 


EKTIKETEGKGFDAERDAAQAALD 
DLKK 


16 


MAP 3 


544-573 


KAOEDNNLDDMKAKLEALNEKAQG 


17 






LAVKLY 


MAP 4 


583-607 


QEGAEGAQATGNAGDDWDGEFTE 


— 1 






K 



S • Identification and Localization of Linear B- 
Cell Epitopes 

The results presented in FIG. 23 revealed that 
most of the imm\ino logical reactivity was observed with the 
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10 



15 



peptides localized within amino acid residues 457 and 607- 
corresponding to the C-151 fragment of HSP72 . Rabbit, 
mice and monkey sera antibody from animals immunized with 
either recombinant HSP72j-ec of C-169rec were reactive with 
both, peptide MAP2 and peptide MAP4 . Interestingly, the 
sequence of peptides MAP2 and MA.P4 spans the 
hypervariable carboxyl- terminal region containing the 
sequences GFDAERDAAQAALDD (residues 527 to 541) and 
AEGAQATGNAGDDW (residues 586 to 600) defined as exclusive 
to S. pneumoniae HSP72 based on the comparison of HSP70 
protein sequences available in the data banks. Our data 
thus revealed that both peptide sequences contain linear 
B-cell epitopes. In addition, the peptide MAP4 alone was 
also recognized by the MAb Fl-Pn3.1. This reactivity was 
confirmed by fluid-phase inhibition assays in which 10 
ug/ml of MAP4 caused complete inhibition of Fl-Pn3.1 
binding to HSP72 . Polyclonal antisera from animals 
immunized with the complete HSP72 recombinant protein also 
recognized B-cell epitopes localized on peptides CS875, 
20 MAPI and MAP3 . All together these data indicate that the 
hypervariable C-151 terminal fragment of the HSP72 
stimulates B-cell responses and possibly constitutes the 
immunodominant portion of the HSP72 protein: The lack of 
reactivity of MAbs F2-Pn3.3 and F2-Pn3.4 with the 
25 synthetic peptides suggest that they react with 

conformational determinants present on the C-terminal 
region of the HSP72 . The existence of protective epitopes 
in the C-151 region was strongly suggested in Example 5 
where mice immunized with purified C-169rec were protected 
from fatal infection with a virulent strain of S. 
pneumoniae thus suggesting that the carboxyl -terminal 
fragments C-169 or C-151 of_5. pneumoniae HSP72 or even 
smaller fragments thereof may prove very useful for the 
development of a future vaccine. 

The variable region comprised within the amino 
acid residues 244 to 330 also constitutes an antigenic 
domain. Linear epitopes located on overlapping peptides 
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CS877 (amino acids 257 to 271) and CS878 (amino acids 268- 
to 281), peptides CS880 (amino acis 286-299) and peptides 
CS882 (amino acids 315-333) were identified by hyperimmune 
sera . 

5 

EXAMPLE 9 - HSP70 (DnaK) from Streptococcus pyogenes and 
Streptococcus agalactiaei Molecular Cloning and DNA 
Sequencing of the hsp70 Genes; Nucleotide and Protein 
Sequence Analyses; Antigenic Relatedness to S. pneumoniae; 
10 Increased Streptococcus agalactiae HSP70 synthesis "in 
response to heat. 

A. Procedures 

1 • Bacterial Strains and Plasmid Vector 

15 The strains of S. pyogenes (Group A 

Streptococcus) and S. agalactiae (Group B Streptococcus) 
used in this study were provided by the Labor atoire de la 
Sante Publique du Quebec (LSPQ) , Sainte-Anne de Bellevue, 
Quebec, Canada. S. agalactiae type II strain V8 

20 corresponds to the ATCC strain 12973. S. pyogenes strain 
Bruno corresponds to the ATCC strain 19615. The E. coli 
strain XLI Blue MRF ' was obtained from Stratagene. 

Streptococcal strains were grown at 37°C in a 5 
% CO2 incubator. The streptococci were streaked on 

25 tryptic soy agar plates containing 5 % sheep blood (Les 
Laboratoires Quelab, Montreal, Canada), liquid cultures 
were made in heart infusion broth (Difco Laboratories, 
Detroit, MI) v/ithout agitation. The E, coli strain was 
grown at 3 7°C in L-broth with agitation at 250 rpm or on L- 

30 agar . 

The general cloning phagemid pBluescript KS{-) 

was purchased from Stratagene. 

2 , Recombinant DNA Techniques 
Restriction enzymes, T4 DNA ligase, and calf 
35 intestinal phosphatase were used as recommended by the 

suppliers (Pharmacia [Canada] Inc., Bale d'Urfe, Canada; 

and New England Biolabs Ltd., Mississauga, Canada). 
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Preparation of plasmids by equilibrium centrifugation in - 
CsCl-ethidiiim bromide gradients, agarose gel 
electrophoresis of DNA fragments. Southern hybridization, 
and colony DNA hybridization were performed as described 
5 by J . Sambrook et al.t supra] . Chromosomal DNA of the 

streptococcal bacteria was prepared using the procedure of 
B. M. Jayarao et al . [J. Clin. Microbiol., 29, pp. 2774- 
2778 (1991)] adapted for bacterial cultures of 90 ml. 
Rapid plasmid preparations were made accordingly to D. 
10 Ish-Horowicz et al . [Nucl, Acids Res. 9, pp. 2989-2998 
(1981)] . Plasmids used for DNA sequencing were purified 
using plasmid kits from Qiagen Inc. (Chatsworth, CA) . DNA 
fragments were purified from agarose gels by the method of 
phenol freeze [S. A. Benson, Biotechniques 2, pp. 67-68 
15 (1984)] . DNA probes were labeled with a^2p«^Q»pp 

digoxigenin (DIG) -11-dUTP using the random primer labeling 
Icits of Boehringer Mannheim (Laval, Canada) . Plasmid 
transformations were carried out by the method of Simanis 
[Hanahan, D. In D. M. Glover (ed. ) , DNA Cloning, pp. 109- 
20 135, (1985)]. The sequencing of genomic DNA inserts in 
plasmids was done using synthetic oligonucleotides. The 
sequencing reactions were carried out by the polymerase 
chain reaction (PCR) using the Taq Dye Deoxy Terminator 
Cycle Sequencing kit (ABI) and DNA electrophoresis was 
performed on automated DNA sequencer 373A (ABI) . The 
assembly of the DNA sequence was performed using the 
program Sequencher 3 . 0 from the Gene Codes Corporation 
(Ann Arbor, MI) . Analysis of the DNA sequences and their 
predicted polypeptides were performed with the program 
Gene Works version 2.45 from Intelligenetics , Inc. 
(Mountain View, CA) . DNA amplification reactions were 
made using a DNA Thermal Cycler 480, Perkin Elmer. 
Oligonucleotides were synthesized by oligonucleotide 
synthesizer model 394 (ABI). 
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3 . Molecular Cloning of the Genes hspJO/dnak- 
of S. aga.lactlae and S . pyogexiBs 

Chromosomal DNA from S. agralactiae and S. 
pyogenes was digested to completion with various 
5 restriction enzymes with palindromic hexanucleotide 
recognition sequences. The digests were analysed by 
Southern hybridization using a labeled PCR-anplif ied DNA 
probe corresponding to a 782 base-pairs region starting at 
base 332 downstream from the ATG initiation codon of the 

10 HSP72 gene of S. pneumoniae (see SEQ ID NO 4) . This DNA 
region was selected because it is relatively well 
conserved among the hsplO genes of Gram-positive bacteria 
that have been characterized. The PCR air^lif ication was 
done on the genomic DNA of S. pneumoniae using the 

15 oligonucleotides OCRR2 ( 5 ' - AAGCTGTTATCACAGTTCCGG ) and 
0CRR3 ( 5 • -GATACCAAGTGACAATGGCG ) . Hybridizing genomic 
restriction fragments of sufficient size to code for a 70- 
kDa polypeptide (>1.8 kb) were partially purified by 
extraction of genomic fragments of corresponding size from 

20 agarose gel. Verification of the presence of the hsp70 
gene among the purified genomic restriction fragments was 
done by Southern hybridization using the labeled 7 82 -bp S. 
pneximoniae DNA probe. 

The purified genomic DNA restriction fragments 

25 were cloned into dephosphorylated compatible restriction 
sites of pBluescript KS(-) and transformed into the E. 
coli strain XLI Blue MRF ' . The colonies were screened by 
DNA hybridization using the labeled 782-bp S. pneumoniae 
DNA probe. Extracted plasmids were digested with various 

30 restriction enzymes to evaluate the size of the inserts 
and to verify the presence of the hsp70 gene by Southern 
hybridization using the labeled 782-bp S. pneumoniae DNA 
probe. Plasmid pURVS contains a 4.2-kb Hindlll insert of 
the genomic DNA of S, agalactiae. Plasmid pURV4 contains . 

35 a 3.5-kb Hindlll fragment of the genomic DNA of S. 
pyogenes . 
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^ • Heat Shock and Protein Labeling 
The stress response of S. agalactiae to an heat 
shock was assayed by pulse-labeling with [3 5s ] methionine 
as described before in Example 1. s. agalactlae bacteria 
grov/n overnight in SMAM (Methionine assay Medium 
supplemented with 1 mg/l methionine. 1% (v/v) Isovitalex 
and 1 mg/l choline chloride) were pelleted by 
centrifugation and then resuspended in the methionine- free 
SMAM medium. The bacteria were inciabated at SVC for 1 h 
and then divided into two fractions of equal volume. The 
samples were either incubated at 3 7 or 43°C for 10 minutes 
and then labeled with 100 uCi/ml [ 3 Sg] methionine for 30 
minutes at 37^0. The bacteria were extensively washed with 
PBS and cell extracts were prepared by treatment with 
mutanolysine and lysozyme as described for the DNA 
isolation (M.Jayarao et al . , supra ) followed by 
sonication . 

5 - Immunological Characterization 
A series of six monoclonal antibodies raised to 
the HSP72rec Protein (F3-Pn3.5 to F3-Pn3.10) and the 
monoclonal antibodies Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3, F2- 
Pn3.4 were tested for their reactivity to HSP70 antigens 
from S. pyogenes and 5._agaiactiae_by Western blot 
analysis. Cell lysates from S. pyogenes and_5. agalactiae 
were obtained from treatment with mutanolysine and 
lysozyme (M.Jayarao et al . , supra ) . , sonication and 
boiling in SDS-PAGE sample buffer. Cell lysates from E. 
coli transformed with either pURV4 or pURV6 producing I 
truncated S._pyogenes HSP70 antigens were tested after ' 
boiling in SDS-PAGE san^le buffer. 

B. DNA Sequence Analysis of the hsp70 /dnak Genes 
of Streptococcus pyogenes, Strevtococcus aaalactiae and 
35 streptococcus pneumoniae 

A region of 2438 bases in the 4.2-kb HindXXX 
insert of plasmid pURV5 was sequenced. This sequence 
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contains an open reading frame (ORF) of 1830 nucleotides" 
coding for a polypeptide of 609 amino acids with a 
molecular weight of 64907 (see SEQ ID NO: 7). The ORF has 
an ATG start codon beginning at position 248 and TAA stop 
5 codon ending at position 2077. The ATG start codon is 
preceeded by the sequence GAGG, starting at position 237, 
which is cott^lementary to 16S rRNA and serves as a 
ribosome binding site in E. coli (g. D. Stormo et al . , 
Nucleic Acids Res. 10. pp. 2971-2996 (1982)]. The ORFand 
the polypeptide of the HSP70 of S. agalactiae are, 
respectively, identical at 85 and 95 % to the ORF and 
polypeptide of the HSP72 of S. pneimtoniae. 

Preliminary sequence comparisons with the HSP72 
of S. pneumoniae showed that the 3.5-kb Kindlll insert in 
plasmid PURV4 lacks the 3 • -end coding region of the hsp70 
of S. pyogenes. An atterrpt to clone a 3-kb Sail genomic 
fragment containing the entire coding region of hsp70 of 
S. pyogenes yielded plasmid pURV6 containing a 3 . 1-kh 
insert lacking the 5 • -end coding region of the gene. The 
assembly of the hsp70 gene regions present in plasmids 
PURV4 and pURV6 gave a 2183 nucleotide region containing 
an ORF of 1824 bases coding for a polypeptide of 608 amino' 
acids with a molecular weight of 64847 (see SEQ ID NO: 
20) . The ATG start codon begins at position 204 and the 
TAA stop codon extends to position 2030. Similarly to the 
hsp70 of S. agalactxae. the ATG start codon is preceeded 
by a putative ribosome binding site sequence GAGG starting 
at position 193 [G. D. Stormo, supra ] . The ORF and the 
deduced polypeptide of the hsp70 of S. pyogenes are, 
respectively, identical at 85 and 94 % to the ORF and 
polypeptide of the HSP72 of S. pnexmioniae . The ORF of 
plasmid PURV4 lacks 125 base pairs coding for 41 amino 
acids at the carboxyl end of the HSP70 of pyogenes ; 
the ORF thus codes for the 567 amino acids of the amino 
35 end of that HSP70 (N-567rec) - The ORF of plasmid pURV6 
lacks 114 base pairs coding for 38 amino acids at the 
amino end of the HSP70 of S. pyogenes ; the ORF thus codes 
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for the 57 0 amino acids of the carboxyl end of that HSP7e 
(C-570rec) • 

The global comparison of the DNA open reading frames 
(FIG. 24) and amino acid sequences (FIG. 25) of the 
5 HSP70/DnaK of S. pyogenes, S. agalactiae, and S. pneumoniae 
gave percentages of identity of 82 and 93 %, respectively. 

^- Increased Synthesis of HSP70 by S. agalactiae 
in Response to Heat 



10 



One dimensional SDS-polyacrylamide gel 
electrophoretic analysis of cell extracts of heat-shocked 
and control S. agalactiae pulse-labeled with 
[3 5s ] methionine revealed that the synthesis of a 70 kDa- 
protein was significantly increased after a thermal stress 
15 (FIG. 26, lanes 1 and 2) . Radioimmunoprecipitation 

analysis revealed that the heat inducible 70kDa-protein 
was easily detected at 43°C using monoclonal antibody F2- 
Pn3.4 thus indicating that the protein belongs to the heat 
shock protein 70 (hsp70/DnaK) family (FIG. 26, lanes 3 and 
20 4) . 

D. Antigenic Relatedness of HSP70 Proteins in 
S. pneumoniae, _S. pyogenes and S. agalactiae 

In this study, a panel of MAbs were used to 
25 investigate the antigenic relatedness of S. pyogenes, s. 
agalactiae and S. pneumoniae HSP70 proteins. Eight of ten 
MAbs reacted with all three Streptoccocus species thus 
indicating that some B-cell epitopes are widely 
distributed among S. pneumoniae , S. pyogenes and S. 
30 agalactiae. The MAb Fl-Pn3.1 which is directed against an 
epitope located between amino acid residues 584 and 607 of 
HSP72 from_5. pneumoniae did not react with HSP70 
antigens from either S. pyogenes or S. agalactiae. 
Comparison of this region among the three Streptococcus 
species revealed differences in 5 to 8 amino acids located 
between amino acids 589 and 596. The MAb F2-Pn3.3 which 
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was also directed against epitopes present in the C-151 
region was reactive with S. agralactiae but not wih S. 
pyogenes. These data clearly indicate that HSP70 proteins 
from Streptococcus species are structurally and 
; immunologically related. There is however immunological 
distinction . 

Analysis of the reactivity of MAbs F3-Pn3.5, F3- 
Pn3.6, F3-Pn3.7 and F3-Pn3.10 with truncated recombinant 
S. pyogenes HSP70 antigens allowed the identification of 
an antigenic region near the amino- terminal end on the S, 
pneumoniae HSP72 . These MAbs reacted with constructs 
expressing the N-terminal 567 amino acid residues but 
failed to react with constructs expressing the C-570 
fragment. These data localized the epitopes recognized by 
the MAbs F3-Pn3.5. F3-Pn3.6, F3-Pn3.7 and F3-Pn3.10 to 
between residues 1 and 38 of the HSP72 protein. 
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EXAMPLE 10 - Use of HSP70/HSP72 As A Human Vaccine 

To formulate a vaccine for human use, 
appropriate HSP72 antigens may be selected from the 
polypeptides described herein. For exart^le, one of skill 
in the art could design a vaccine around the HSP7 0/HSP72 
polypeptide or fragments thereof containing an immunogenic 
25 epitope. The use of molecular biology techniques is 
particularly well-suited for the preparation of 
substantially pure recombinant antigens. 

The vaccine composition may take a variety of 
forms. These include, for exairple solid, semi-solid and 

30 liquid dosage forms, such as powders, liquid solutions or 
suspensions, and liposomes. Based on our belief that the 
HSP70/HSP72 antigens of this invention may elicit a 
protective immune response v/hen administered to a human, 
the con^ositions of this invention will be similar to 

35 those used for immunizing humans with other proteins and 
polypeptides, e.g. tetanus and diphtheria. Therefore, the 
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compositions of this invention will preferably con«)rise a 
pharmaceutcially acceptable adjuvant such as incomplete 
Freund's adjuvant, aluminum hydroxide, a muramyl peptide, 
a water-in oil emulsion, a liposome, an ISCOM or CTB, or a 
5 non- toxic B subunit from cholera toxin. Most preferably, 
the compositions will include a water-in-oil emulsion or 
aluminijm hydroxide as adjuvant. 

The composition would be administered to the 
patient in any of a niomber of pharmaceutically acceptable 
forms including intramuscular, intradermal, subcutaneous 
or topic. Preferrably, the vaccine will be administered 
intramuscularly . 

Generally, the dosage will consist of an initial 
injection, most probably with adjuvant, of about 0.01 to 
10 mg, and preferably 0.1 to 1.0 mg HSP72 antigen per 
patient, followed most probably by one or more booster 
injections. Preferably, boosters will be administered at 
about 1 and 6 months after the initial injection. 

An in^jortant consideration relating to 
pneiimdcoccal vaccine development is the question of 
mucosal immunity. The ideal mucosal vaccine will be 
safely taken orally or intranasally as one or a few doses 
and would elicit protective antibodies on the appropriate 
surfaces along with systemic immunity. The mucosal 
25 vaccine composition may include adjuvants, inert 
particulate carriers or recombinant live vectors. 

The anti-HSP72 antibodies of this invention are 
useful for passive immunotherapy and immunoprophylaxis of 
humans infected with S. pneumoniae, S. pyogenes, S. 
agalactiae or related bacteria. The dosage forms and 
regimens for such passive immunization would be similar to 
those of other passive immunotherapies. 

An antibody according to this invention is 
exemplified by a hybridoma producing MAb Fl-Pn3.1 
deposited in the American Type Culture Collection in 
Rockville, Maryland. USA on July 21, 1995, and identified 
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as Murine Hybridoma Cell Line, Fl-Pn3.1. This deposit was 
assigned accession number HB 11960. 



embodiments of this invention, it is apparent that our 
5 basic embodiments may be altered to provide other 

embodiments that utilize the compositions and processes of 
this invention. Therefore, it will be appreciated that 
the scope of this invention includes all alternative 
embodiments and variations that are defined in the 
10 foregoing specification and by the claims appended hereto ; 
and the invention is not to be limited by the specific 
embodiments which have been presented herein by way of 
example . 



While we have described herein a number of 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

{i) APPLICANT: Hamel . Josee 

Brodeur, Bernard R 
Martin, Denis 
Rioux, Clement 

(ii) TITLE OF INVENTION: STREPTOCOCCAL HEAT SHOCK PROTEINS 
MEMBERS OF THE HSP70 FAMILY 



10 



15 



35 



(iii) NUMBER OF SEQUENCES: 26 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Goudreau Gage Dubuc & Martineau Walker 

(B) STREET: 800 Place Victoria, Suite 3400, Stock 

Exchange Tower 
20 (C) CITY: Montreal 

(D) STATE: Quebec 

( E ) COUNTRY : CANADA 
<F) ZIP: H421E9 

25 (V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

<D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/472,534 

(B) FILING DATE: 07-JUN-19 95 

40 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US ( PROVIS) 60/001 , 805 

(B) FILING DATE: 04 -AUG- 19 9 5 

(viii) ATTORNEY /AGENT INFORMATION: 
45 (A) NAME: Led ere /Dubuc/ Prince , Alain /Jean /Gae tan 

(C) REFERENCE /DOCKET NUMBER: BIOVAC2-PCT 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (514) 397-7400 
50 (B) TELEFAX: (514) 397-4382 



(2) INFORMATION FOR SEQ ID NO : 1 : 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

60 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

65 (iv) ANTI- SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

70 (ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 30.. 755 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
^ (B) LOCATION: 771.. 2912 

(D) OTHER INFORMATION: /product= -FucI/HSP72 (C-169) - 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAACTTCATT TTTAGAAAGG AGTGAGTTT ATG TCT CAA GAT GAA AAA TTA ATT 

Met Ser Gin Asp Glu Lys Leu lie 
1 5 

CGT GAA CAG ATT TGT GAT GTT TGT CAT AAG ATG TCG CAA CTT GGT TCG 
Arg Glu Gin He Cys Asp Val Cys His Lys Met Trp Gin Leu Gly Trp 
J-0 15 20 



ATT CTT GCA ACA CCT ACT GGT ATC AGC AAA AGT TTT ATT ACA CCA GAA 
He Leu Ala Thr Pro Thr Gly He Ser Lys Ser Phe He Thr Pro Glu 
45 50 55 

AAG CTG GTG AAG TTA AAT CTT AAA GGA GAG ATT TTA GAA GCA GAA GGT 
Lys Leu Val Lys Leu Asn Leu Lys Gly Glu He Leu Glu Ala Glu Glv 
60 65 70 

GAT TAC TGT CCT TCT AGT GAA ATT AAA ATC CAC ATT CGG TCC TAC GAA 
Asp lyr cys Pro Ser Ser Glu He Lys Met His He Arg Cys Tyr Glu 
'5 80 85 

GAA CGT GAG GAT GTT CGT TCA GTT GTT CAC GCG CAT CCA CCG ATT GCA 
Glu Arg Giu Asp Val Arg Ser Val Val His Ala His Pro Pro He Ala 
^° 95 100 



fz*^? '^^'^ ™^ ACT TTA GAA TTA GTC GCA 

Asp Val He Thr Ala Tyr T,'r Arg Met Glu Thr Leu Glu Leu Val Ala 

175 180 
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101 



20 SIT ^ ^""^ '^'^ TTA GAT GAG GAT ACC 14 9 

20 Val Ala Ala Asn Asp Gly Asn Val Ser Val Arg Leu Asp Glu Asp Thr 
2^ 30 35 40 



197 



245 



293 



341 



An TTA GAT ACT TAT TCA CTA ATT 389 

40 Thr Gly Phe Ala Leu Ala His He Pro Leu Asp Thr Tyr Ser Leu ill 

115 120 

GAG AGC GCG ATT GTG GTT GGG GCA ATT CCT ATT ACC CCA TTT GGA GTA 
Glu Ser Ala He Val Val Gly Ala He Pro He Thr Pro Phe Gly Val 
125 130 135 

CCG TCT ACA ATC GAA GTC CCA GAA GCA ATT ACA CCT TAT CTC CCC GAT 
Pro Ser Thr Met Glu Val Pro Glu Ala He Thr Pro Tyr Leu Pro Asp 
50 145 150 

CAT GAT GTC ATG CTA TTA GAA AAT CAT GGA GCT CTG ACT GTC GGA AGC 
His Asp Val Met Leu Leu Glu Asn His Gly Ala Leu Thr Val Gly Ser 
155 160 165 
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485 



53 3 



581 



^ft ^ 5f ^'^ ^ TCT ACA AAG GGC ATT GAG 629 

60 Lys Thr Thr Phe His Gly Arg Met Leu Leu Ser Kir Lys Gly lie Glu 

190 195 200 . 

r^?. ^7*^ ^^'^ '^^ CTA TTC TCA ATG CGA 

Glu Gin Glu He Ala Arg Pro Thr Leu Glu Arg Leu Phe Ser Met Ara 
205 210 215 

GAA AAT TAT AAG GTT ACA GGT CGT CAC CCA GGC TAC CGT AAA TAT AAT 
Glu Asn Tyr Lys Val Thr Gly Arg His Pro Gly Tyr Arg Lys lyr Asn 
220 225 230 



677 



725 
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GGC GAT GGT AGT ATA AAA GAA ACA AAA AAA TAAGAGGAAA GTATT ATC ATC 
Gly Asp Gly Ser lie Lys Glu Thr Lys Lys Met lie 

235 240 1 

CAA CAT CCA CGT ATT GGG ATT CGT CCG ACT ATT GAT GGT CGT CGT CAA 
Gin His Pro Arg lie Gly He Arg Pro Thr He Asp Gly Arg Arg Gin 
5 10 15 

GGT GTA CGC GAA TCA CTT GAA GTA CAA ACA ATG AAC ATG GCT AAA AGT 
Gly Val Arg Glu Ser Leu Glu Val Gin Uir Met Asn Met Ala Lys Ser 
20 25 30 



776 



824 



872 



15 



20 



25 



30 



GTG GCA GAT TTG ATT TCA AGC ACA TTG AAA TAT CCA GAT GGG GAA CCT 920 
Val Ala Asp Leu He Ser Ser Thr Leu Lys Tyr Pro Asp Gly Glu Pro 
35 40 45 50 

GTG GAA TGT GTG ATT TCT CCA TCT ACC ATT GGT CGT GTT CCA GAG GCT 968 
Val Glu Cys Val He Ser Pro Ser Thr He Gly Arg Val Pro Glu Ala 
55 60 65 

GCA GCT TCC CAT GAG TTG TTT AAA AAA TCA AAT GTT TGC GCA ACA ATT 1016 
Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala T^r He 

70 75 80 

ACA GTT ACA CCA TGC TGG TGT TAT GGT AGT GAA ACT ATG GAT ATG TCT 1064 
Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp Met Ser 
■85 90 95 

CCA GAT ATT CCT CAT GCT ATT TGG GGA TTT AAT GGG ACA GAA CGC CCA 1112 
Pro Asp He Pro His Ala He Trp Gly Phe Asn Gly llir Glu Arg Pro 
100 105 110 



GGA GCT GTC TAT CTT GCA GCT GTA CTA GCT TCA CAT ACT CAA AAA GGG 
35 Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Thr Gin Lys Gly 

115 120 125 130 

ATT CCA GCC TTT GGG ATT TAT GGT AGA GAT GTT CAG GAA GCT AAT GAT 
He Pro Ala Phe Gly He Tyr Gly Arg Asp Val Gin Glu Ala Asn Asp 
^ 135 140 145 



1160 



1208 



45 



50 



55 



60 



65 



AC--. GCT ATT CCA GAA GAT GTC AAA GAA AAA CTT TTA CGT TAT GCG CGG 
Thr Ala He Pro Glu Asp Val Lys Glu Lys Leu Leu Arg Tyr Ala Arg 
150 155 160 

GCA GTT CTT GCA ACT GGC TTG ATG AGA GAC ACT GCT TAC CTA TCA ATG 
Ala Val Leu Ala Thr Gly Leu Met Arg Asp Thr Ala Tyr Leu Ser Met 
165 170 175 

GGT AGT GTT TCG ATG GGG ATT GGT GGT TCT ATT GTA AAT CCA GAT TTC 
Gly Ser Val Ser Met Gly He Gly Gly Ser He Val Asn Pro Asp Phe 
180 185 190 

TTC CAA GAA TAC TTA GGA ATG CGA AAT GAA TCG GTA GAT ATG ACG GAG 
Phe Gin Glu Tyr Leu Gly Met Arg Asn Glu Ser Val Asp Met Thr Glu 
195 200 205 210 

TTC ACG CGC CGT ATG GAC CGT GGT ATT TAC GAC CCT GAA GAG TTC GAA 
Phe Thr Arg Arg Met Asp Arg Gly He Tyr Asp Pro Glu Glu Phe Glu 
215 220 225 

CGT GCG CTC AAA TGG GTG AAA GAA AAC GTA AAA GAA GGA TTC GAC CAT 
Arg Ala Leu Lys Trp Val Lys Glu Asn Val Lys Glu Gly Phe Asp His 
230 235 240 

AAC CGT GAA GAC CTT GTT TTA AGC CGT GAA GAA AAA GAT AGA CAA TGG 
Asn Arg Glu Asp Leu Val Leu Ser Arg Glu Glu Lys Asp Arg Gin Trp 
245 250 255 



1256 



1304 



1352 



1400 



1448 



1496 



1544 
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GAA TTT GTT ATT AAG ATG TTC ATG ATT GGA CGT GAC TTA ATG GTT GGT 1592 - 

Glu Phe Val lie Lys Met Phe Met lie Gly Arg Asp Leu Met Val Gly 
260 265 270 

5 

AAC CCA AGA CTT GCT GAA CTT GGT TTT GAG GAA GAA GCA GTT GGT CAC 1640 
Asn Pro Arg Leu Ala Glu Leu Gly Phe Glu Glu Glu Ala Val Gly His 
275 280 285 290 

10 CAT GCT TTA GTA GCT GGT TTC CAA GGT CAA CGT CAG TGG ACA GAC CAT 1688 
His Ala Leu Val Ala Gly Phe Gin Gly Gin Arg Gin Trp Thr Asp His 
295 300 305 

TTT CCA AAT GGG GAC TTT ATG GAA ACT TTC CTC AAT ACT CAG TTT GAC 173 6 

15 Phe Pro Asn Gly Asp Phe Met Glu Thr Phe Leu Asn Thr Gin Phe Asp 
310 315 320 

TGG AAT GGT ATT CGA AAA CCA TTT GTA TTT GCG ACA GAG AAT GAT TCA 1784 
Trp Asn Gly lie Arg Lys Pro Phe Val Phe Ala Thr Glu Asn Asp Ser 
20 325 330 335 

CTA AAT GGT GTG TCT ATG CTC TTT AAT TAT CTA TTA ACA AAT ACT CCA 1832 

Leu Asn Gly Val Ser Met Leu Phe Asn Tyr Leu Leu Thr Asn Thr Pro 

340 345 350 

25 

CAA ATC TTT GCT GAT GTG CGT ACT TAT TGG AGT CCA GAG GCT GTT GAA 1880 

Gin lie Phe Ala Asp Val Arg Thr Tyr Trp Ser Pro Glu Ala Val Glu 
355 360 365 370 

30 CGT GTA ACA GGA TAT ACT TTA GAG GGT CGT GCT GCA GCT GGA TTC TTA 1928 
Arg Val Thr Gly Tyr Thr Leu Glu Gly Arg Ala Ala Ala Gly Phe Leu 
375 380 385 

CAT CTA ATC AAC TCT GGA TCT TGT ACA TTG GAT GGT ACA GGT CAA GCT 1976 
35 His Leu lie Asn Ser Gly Ser Cys Thr Leu Asp Gly thr Gly Gin Ala 

390 395 400 

ACT CGA GAT GGC AAA CCT GTT ATG AAA CCA TTC TGG GAG TTG GAT GAA 2024 
Thr Arg Asp Gly Lys Pro Val Met Lys Pro Phe Trp Glu Leu Asp Glu 
40 405 410 415 

AGT GAA GTA CAG GCT ATG CTT GAA AAT ACA GAC TTC CCA CCA GCA AAC 2072 

Ser Glu Val Gin Ala Met Leu Glu Asn Thr Asp Phe Pro Pro Ala Asn 
420 425 430 

45 

CGC GAA TAC TTC CGT GGA GGA GGA TTC TCA ACT CGT TTC TTG ACG AAG 2120 

Arg Glu Tyr Phe Arg Gly Gly Gly Phe Ser Thr Arg Phe Leu Thr Lys 
435 440 445 450 

50 GGG GAT ATG CCA GTA ACA ATG GTA CGT CTC AAT CTT TTA AAA GGG GTT 2168 
Gly Asp Met Pro Val Thr Met Val Arg Leu Asn Leu Leu Lys Gly Val 
455 460 465 

GGT CCA GTG CTA CAA ATT GCA GAA GGT TAC ACA CTT GAA CTT CCT GAA 2216 
55 Gly Pro Val Leu Gin lie Ala Glu Gly Tyr Thr Leu Glu Leu Pro Glu 
470 475 480 

GAT GTT CAC CAT ACT TTA GAT AAT CGT ACA GAT CCA GGA TGG CCA ACT 2264 
Asp Val His His Thr Leu Asp Asn Arg Thr Asp Pro Gly Trp Pro Thr 
60 485 490 495 

ACT TGG TTT GCT CCA CGT TTG ACA GGA PJ^ GGT GCT TTC AAG TCT GTC 2312 
Thr Trp Phe Ala Pro Arg Leu Thr Gly Lys Gly Ala Phe Lys Ser Val 
500 505 510 



65 



TAT GAC GTC ATG AAT AAT TGG GGA GCT AAT CAC GGA GCC ATA ACA TAT 2360 
Tyr Asp Val Met Asn Asn Trp Gly Ala Asn His Gly Ala lie Thr Tyr 
515 520 525 530 
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25 



30 



35 



40 



45 



50 



55 



GGA CAC ATT GGA GCA GAC TTG ATT ACC TTG GCT TCT ATG TTG AGA ATT 2408 - 

Gly His He Gly Ala Asp Leu He Thr Leu Ala Ser Met Leu Arg He 
535 540 545 

CCT CAA ATC GAA GTA ACA TTT GAC ATC GAC AAG AAC GGT ATC GTG TCT 2456 
Pro Gin He Glu Val Thr Phe Asp He Asp Lys Asn Gly He Val Ser 
550 555 560 

GTT AAG GCC AAA GAC CTT GGA ACT CAA AAA GAA CAA ACT ATT GTC ATC 2504 
Val Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Uir He Val He 
565 570 575 

CAA TCG AAC TCA GGT TTG ACT GAC GAA GAA ATC GAC CGC ATG ATG AAA 2552 
Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu He Asp Arg Met Met Lys 
580 585 590 

GAT GCA GAA GCA AAC GCT GAA TCC GAT AAG AAA CGT AAA GAA GAA GTA 2600 
Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu Val 
595 600 605 610 

GAC CTT CGT AAT GAA GTG GAC CAA GCA ATC TTT GCG ACT GAA AAG ACA 264 8 

Asp Leu Arg Asn Glu Val Asp Gin Ala He Phe Ala Thr Glu Lys Thr 

615 620 625 

ATC AAG GAA ACT GAA GGT AAA GGC TTC GAC GCA GAA CGT GAC GCT GCC 2696 
He Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp Ala Ala 
630 635 640 

CAA GCT GCC CTT GAT GAC CTT AAG AAA GCT CAA GAA GAC AAC AAC TTG 2744 
Gin Ala Ala Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu 
645 650 655 

GAC GAC ATG AAA GCA AAA CTT GAA GCA TTG AAC GAA AAA GCT CAA GGA 2792 
Asp Asp Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly 

660 665 670 

CTT GCT GTT AAA CTC TAC GAA CAA GCC GCA GCA GCG CAA CAA GCT CAA 284 0 

Leu Ala Val Lys Leu Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Gin 
675 680 685 690 

GAA GGA GCA GAA GGC GCA CAA GCA ACA GGA AAC GCA GGC GAT GAC GTC 2888 
Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp Val 
695 700 705 

GTA GAC GGA GAG TTT ACG GAA AAG TAAGATGAGT GTATTGGATG AAGAGTATCT 2942 
Val Asp Gly Glu Phe Thr Glu Lys 
710 

AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA GCTGATAACT ATAGAACATC 3 002 

AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT AGATATAGAG AAATTATATT 3 062 

AGCTGAGCAT GATAGTTGTG TCAAAAATGA TGAAGCGGTA AGGAATTTTC TTACCTCAGT 3122 

ATTGTTGTCT GCATT1GTAT CGGCGATGGT ATCAGCTATG ATATC 3167 



60 



65 



70 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Gin Asp Glu Lys Leu lie Arg Glu Gin lie Cys Asp Val Cys 
15 10 15 
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His Lys Met Trp Gin Leu Gly Trp Val Ala Ala Asn Asp Gly Asn Val 
20 25 30 

5 Ser Val Arg Leu Asp Glu Asp Thr lie Leu Ala T^ir Pro Thr Gly lie 
35 40 45 

Ser Lys Ser Phe lie Thr Pro Glu Lys Leu Val Lys Leu Asn Leu Lys 
50 55 60 

10 

Gly Glu lie Leu Glu Ala Glu Gly Asp Tyr Cys Pro Ser Ser Glu lie 
65 70 75 80 

Lys Met His lie Arg Cys Tyr Glu Glu Arg Glu Asp Val Arg Ser Val 
15 85 90 95 

Val His Ala His Pro Pro lie Ala T^ir Gly Phe Ala Leu Ala His lie 

100 105 110 

20 Pro Leu Asp Thr Tyr Ser Leu lie Glu Ser Ala He Val Val. Gly Ala 
115 120 125 

He Pro He Thr Pro Phe Gly Val Pro Ser Ihr Met Glu Val Pro Glu 
130 135 140 

25 

Ala He Thr Pro Tyr Leu Pro Asp His Asp Val Met Leu Leu Glu Asn 
145 150 155 160 

His Gly Ala Leu Thr Val Gly Ser Asp Val He Thr Ala Tyr Tyr Arg 
30 165 170 175 

Met Glu Thr Leu Glu Leu Val Ala Lys llir Thr Phe His Gly Arg Met 
180 185 190 

35 Leu Leu Ser Thr Lys Gly He Glu Glu Gin Glu He Ala Arg Pro Tlir 

195 200 205 



40 



45 



Leu Glu Arg Leu Phe Ser Met Arg Glu Asn Tyr Lys Val Thr Gly Arg 
210 215 220 

His Pro Gly Tyr Arg Lys Tyr Asn Gly Asp Gly Ser He Lys Glu Thr 

225 230 235 . 240 

Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 3: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Met He Gin His Pro Arg He Gly He Arg Pro Thr He Asp Gly Arg 
60 1 5 10 15 

Arg Gin Gly Val Arg Glu Ser Leu Glu Val Gin Thr Met Asn Met Ala 

20 25 30 

65 Lys Ser Val Ala Asp Leu He Ser Ser Thr Leu Lys Tyr Pro Asp Gly 
35 40 45 

Glu Pro Val Glu Cys Val He Ser Pro Ser Thr He Gly Arg Val Pro 
50 55 60 
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Glu Ala Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala 
65 70 75 80 

5 "Thr lie Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp 

85 90 95 

Met Ser Pro Asp He Pro His Ala He Trp Gly Phe Asn Gly Thr Glu 
100 105 110 

10 

Arg Pro Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Thr Gin 
115 120 125 

Lys Gly He Pro Ala Phe Gly He Tyr Gly Arg Asp Val Gin Glu Ala 
15 130 135 140 

Asn Asp Thr Ala He Pro Glu Asp Val Lys Glu Lys Leu Leu Arg Tyr 
145 150 155 160 

20 Ala Arg Ala Val Leu Ala Thr Gly Leu Met Arg Asp Thr Ala Tyr Leu 

165 170 175 

Ser Met Gly Ser Val Ser Met Gly He Gly Gly Ser He Val Asn Pro 
180 185 190 

25 

Asp Phe Phe Gin Glu Tyr Leu Gly Met Arg Asn Glu Ser Val Asp Met 
195 200 205 

Thr Glu Phe Thr Arg Arg Met Asp Arg Gly He Tyr Asp Pro Glu Glu 
30 210 215 220 

Phe Glu Arg Ala Leu Lys Trp Val Lys Glu Asn Val Lys Glu Gly Phe 
225 230 235 240 

35 Asp His Asn Arg Glu Asp Leu Val Leu Ser Arg Glu Glu Lys Asp Arg 

245 250 255 

Gin Trp Glu Phe Val He Lys Met Phe Met He Gly Arg Asp Leu Met 
260 265 270 

40 

Val Gly Asn Pro Arg Leu Ala Glu Leu Gly Phe Glu Glu Glu Ala Val 
275 280 285 

Gly His His Ala Leu Val Ala Gly Phe Gin Gly Gin Arg Gin Trp Thr 
45 290 295 300 

Asp His Phe Pro Asn Gly Asp Phe Met Glu Thr Phe Leu Asn Thr Gin 
305 310 315 320 

50 Phe Asp Trp Asn Gly He Arg Lys Pro Phe Val Phe Ala Thr Glu Asn 

325 330 335 

Asp Ser Leu Asn Gly Val Ser Met Leu Phe Asn Tyr Leu Leu Thr Asn 
340 345 350 

55 

Thr Pro Gin He Phe Ala Asp Val Arg Thr Tyr Trp Ser Pro Glu Ala 
355 360 365 

Val Glu Arg Val Thr Gly Tyr Thr Leu Glu Gly Arg Ala Ala Ala Gly 
60 370 375 380 

Phe Leu His Leu He Asn Ser Gly Ser Cys Thr Leu Asp Gly Thr Gly 

385 390 395 400 

65 Gin Ala Thr Arg Asp Gly Lys Pro Val Met Lys Pro Phe Trp Glu Leu 

405 410 415 

Asp Glu Ser Glu Val Gin Ala Met Leu Glu Asn Thr Asp Phe Pro Pro 
420 425 430 
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25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



Ala Asn Arg Glu lyr Phe Arg Gly Gly Gly Phe Ser Thr Arg Phe Leu 
435 440 445 

Thr Lys Gly Asp Met Pro Val Thr Met Val Arg Leu Asn Leu Leu Lys 
450 455 460 

Gly Val Gly Pro Val Leu Gin lie Ala Glu Gly Tyr Thr Leu Glu Leu 
465 470 475 480 

Pro Glu Asp Val His His Thr Leu Asp Asn Arg Thr Asp Pro Gly Trp 

485 , 490 495 

Pro Thr Thr Trp Phe Ala Pro Arg Leu TSir Gly Lys Gly Ala Phe Lys 
500 505 510 

Ser Val Tyr Asp Val Met Asn Asn Trp Gly Ala Asn His Gly Ala He 

515 520 525 

Thr Tyr Gly His He Gly Ala Asp Leu He Thr Leu Ala Ser Met Leu 
530 535 540 

Arg He Pro Gin He Glu Val Thr Phe Asp He Asp Lys Asn Gly He 
545 550 555 560 

Val Ser Val Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr He 
565 570 575 

Val He Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu He Asp Arg Met 

580 585 590 

Met Lys Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu 
595 600 605 

Glu Val Asp Leu Arg Asn Glu Val Asp Gin Ala He Phe Ala Thr Glu 

610 615 620 

Lys Thr He Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp 
625 630 63 5 640 

Ala Ala Gin Ala Ala Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn 
645 €50 655 

Asn Leu Asp Asp Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala 
660 665 670 

Gin Gly Leu Ala Val Lys Leu TVr Glu Gin Ala Ala Ala Ala Gin Gin 
675 680 685 

Ala Gin Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp 
690 695 700 

Asp Val Val Asp Gly Glu Phe Thr Glu Lys 
705 710 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Streptococcus pneumoniae 
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(ix) FEATDRE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 682.. 2502 

5 (D) OTHER INFORMATION: /product= ""Heat-shock protein 72"" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3265.. 4320 

10 (D) OTHER INFORMATION: /products " '*NH2- terminal portion of 

DNA J" " 

(ix) FEATURE: 

(A) NAME/KEY: mat^peptide 
15 (B) LOCATION: 682.. 2502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 AAGCTTGATT CACGCTTTGA AAGAAGAAGG AATTGAAGAA ATCGCAGCAG ATCGCGAATT 60 

TGACCATAAC TACCATATGG CCATCCAAAC TCTCCCAGCA GACGATGAAC ACCCAGTAGA 120 

TACCATCGCC CAAGTCTTTC AAAAAGGCTA CAAACTCCAT GACCGCATCC TACGCCCAGC 180 

AATGGTAGTG GTGTATAACT AAGATACAAA GCCCGTAAAA AGCTCGCAGT AAAAATAGGA 240 

GATTGACGAA GTGTTCGATG AACACAAGAA AATCTATCTT TTTTACTCAG AGCTTAGGGC 300 

30 GTGTTCGATT CGGCAATTCT GACGGTAGCT AAAGCAACTC GTCAGAAAAC GGCAGTCGCT 3 60 

ATGGCGTTTG TCTAGCTTCC TTACTAACTC GTCGTCGAAA TAAAATCGAT TTCGACTCTT 420 

CGTGTCGCAA TTTACATAAT AGAAAACTTG TCCGAAACGA CAATAAACTA TGAAGAAAGA 480 

TAAAATATGT TTGGCTTTGT AATAGTGAGC GAAGCGAACC AAAGACGATA CTCTTCGCTG 540 

TGGCGCTATT TGCGCAAATT TTGAGACCTT AGGCTCAAAG TTTAGTCAAA GAGATTGACA 600 

40 AAGTCAAGCT CTGACGGCGT CGCCACTTAA GAAGAGTATC AAAAAGAAAA ATAGAAAATT 660 



45 
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AACTAACAAG GAGAAAAACA C ATG TCT AAA ATT ATC GGT ATT GAG TTA GGT 711 

Met Ser Lys He He Gly He Asp Leu Gly 
15 10 

ACA ACA AAC TCA GCA GTT GCA GTT CTT GAA GGA ACT GAA AGC AAA ATC 759 
Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser Lys He 
15 20 25 



50 ATC GCA AAC CCA GAA GGA AAC CGC ACA ACT CCA TCT GTA GTC TCA TTC 
He Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val Ser Phe 
30 35 40 



807 



AAA AAC GGA GAA ATC ATC GTT GGT GAT GCT GCA AAA CGT CAA GCA GTT 855 
55 Lys Asn Gly Glu He He Val Gly Asp Ala Ala Lys Arg Gin Ala Val 
45 50 55 

ACA AAC CCA GAT ACA GTT ATC TCT ATC AAA TCT AAG ATG GGA ACT TCT 903 
Thr Asn Pro Asp Thr Val He Ser He Lys Ser Lys Met Gly Thr Ser 
60 €0 65 70 

GAA AAA GTT TCT GCA AAT GGA AAA GAA TAC ACT CCA CAA GAA ATC TCA 951 
Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu He Ser 
75 80 85 90 



GCT ATG ATC CTT CAA TAC TTG AAA GGC TAC GCT GAA GAC TAC CTT GGT 999 
Ala Met He Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr Leu Gly 
95 100 105 
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GAG AAA GTA ACC AAA GCT GTT ATC ACA GTT CCG GCT TAC TTC AAC GAC 1047 - 

Glu Lys Val Thr Lys Ala Val lie Thr Val Pro Ala lyr Phe Asn Asp 
110 115 120 

5 

GCT CAA CGT CAA GCA ACA AAA GAC GCT GGT AAA ATT GCT GGT CTT GAA 1095 
Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys lie Ala Gly Leu Glu 
125 130 135 

10 GTA GAA CGT ATT GTT AAC GAA CCA ACT GCA GCA GCT CTT GCT TAT GGT 1143 
Val Glu Arg lie Val Asn Glu Pro Uir Ala Ala Ala Leu Ala TVr Gly 
140 145 150 

TTG GAC AAG ACT GAC AAA GAA GAA AAA ATC TTG GTA TTT GAC CTT GGT 1191 
15 Leu Asp Lys Thr Asp Lys Glu Glu Lys He Leu Val Phe Asp Leu Gly 
155 160 165 170 

GGT GGT ACA TTC GAC GTC TCT ATC CTT GAA TTG GGT GAC GGT GTC TTC 1239 
Gly Gly Thr Phe Asp Val Ser He Leu Glu Leu Gly Asp Gly Val Phe 
20 175 180 185 

GAC GTA TTG TCA ACT GCA GGG GAC AAC AAA CTT GGT GGT GAC GAC TTT 1287 
Asp Val Leu Ser Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp Asp Phe 
190 195 200 

25 

GAC CAA AAA ATC ATT GAC CAC TTG GTA GCA GAA TTC AAG AAA GAA AAC 133 5 

Asp Gin Lys He He Asp His Leu Val Ala Glu Phe Lys Lys Glu Asn 
205 210 215 

30 GGT ATC GAC TTG TCT ACT GAC AAG ATG GCA ATG CAA CGT TTG AAA GAT 1383 
Gly He Asp Leu Ser Thr Asp Lys Met Ala Met Gin Arg Leu Lys Asp 
220 225 230 

GCG GCT GAA AAA GCG AAG AAA GAC CTT TCT GGT GTA ACT TCA ACA CAA 1431 
35 Ala Ala Glu Lys Ala Lys Lys Asp Leu Ser Gly Val Thr Ser Thr Gin 
235 240 245 250 

ATC AGC TTG CCA TTT ATC ACT GCA GGT GAG GCT GGA CCT CTT CAC TTG 1479 
He Ser Leu Pro Phe He Thr Ala Gly Glu Ala Gly Pro Leu His Leu 
40 255 260 265 

GAA ATG ACT TTA ACT CGT GCG AAA TTT GAT GAT TTG ACT CGT GAC CTT 1527 

Glu Met Thr Leu Thr Arg Ala Lys Phe Asp Asp Leu Thr Arg Asp Leu 

270 275 280 

45 

GTT GAA CGT ACA AAA GTT CCA GTT CGT CAA GCC CTT TCA GAT GCA GGT 1575 

Val Glu Arg Thr Lys Val Pro Val Arg Gin Ala Leu Ser Asp Ala Gly 

285 290 295 

50 TTG AGC TTG TCA GAA ATC GAC GAA GTT ATC CTT GTT GGT GGT TCA ACT 1623 
Leu Ser Leu Ser Glu He Asp Glu Val He Leu Val Gly Gly Ser Thr 
300 305 310 

CGT ATC CCT GCC GTT GTT GAA GCT GTT AAA GCT GAA ACT GGT AAA GAA 1671 
55 Arg He Pro Ala Val Val Glu Ala Val Lys Ala Glu Tixr Gly Lys Glu 
315 320 325 330 

CCA AAC AAA TCA GTA AAC CCT GAT GAA GTA GTT GCT ATC GGT GCG GCT 1719 
Pro Asn Lys Ser Val Asn Pro Asp Glu Val Val Ala Met Gly Ala Ala 
60 335 340 345 

ATC CAA GGT GGT GTG ATT ACT GGT GAT GTC AAG GAT GTT GTC CTT CTT 1767 
He Gin Gly Gly Val He Thr Gly Asp Val Lys Asp Val Val Leu Leu 
350 355 360 
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GAT GTA ACG CCA TTG TCA CTT GGT ATC GAA ACA ATG GGT GGA GTA TTT 1815 
Asp Val Thr Pro Leu Ser Leu Gly He Glu T^ir Met Gly Gly Val Phe 
365 370 375 



91 



aNSDOCIO: <WO_964092aAlJ_> 



i 

wo 96/40928 PCT/C A96/00322 

ACA AAA CTT ATC GAT CGC AAC ACT ACA ATC CCA ACA TCT AAA TCA CAA 1863 
Thr Lys Leu He Asp Arg Asn Thr Ttir He Pro Thr Ser Lys Ser Gin 
380 385 390 

GTC TTC TCA ACA GCA GCA GAC -AAC CAA CCA GCC GTT GAT ATC CAC GTT 1911 
Val Phe Ser Thr Ala Ala Asp Asn Gin Pro Ala Val Asp He His Val 
395 400 405 410 

10 CTT CAA GGT GAA CGC CCA ATG GCA GCA GAT AAC AAG ACT CTT GGA CGC 1959 
Leu Gin Gly Glu Arg Pro Met Ala Ala Asp Asn Lys Thr Leu Gly Arg 
415 420 425 



TTC CAA TTG ACT GAT ATC CCA GCT GCA CCT CGT GGA ATT CCT CAA ATC 
15 Phe Gin Leu Thr Asp He Pro Ala Ala Pro Arg Gly He Pro Gin He 
430 435 440 



25 



30 



35 



45 



55 



2007 



GAA GTA ACA TTT GAC ATC GAC AAG AAC GGT ATC GTG TCT GTT AAG GCC 2055 
Glu Val Thr Phe Asp He Asp Lys Asn Gly He Val Ser Val Lys Ala 
20 445 450 455 



AAA GAC CTT GGA ACT CAA AAA GAA CAA ACT ATT GTC ATC CAA TCG AAC 2103 
Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr He Val He Gin Ser Asn 

460 465 470 

TCA GGT TTG ACT GAC GAA GAA ATC GAC CGC ATG ATG AAA GAT GCA GAA 2151 
Ser Gly Leu Thr Asp Glu Glu He Asp Arg Met Met Lys Asp Ala Glu 
475 480 485 490 

GCA AAC GCT GAA TCC GAT AAG AAA CGT AAA GAA GAA GTA GAC CTT CGT 2199 
Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu Val Asp Leu Arg 
495 500 50.5 

AAT GAA GTG GAC CAA GCA ATC TTT GCG ACT GAA AAG ACA ATC AAG GAA 2247 
Asn Glu Val Asp Gin Ala He Phe Ala Thr Glu Lys Thr He Lys Glu 

510 515 520 



ACT GAA GGT AAA GGC TTC GAC GCA GAA CGT GAC GCT GCC CAA GCT GCC 229 5 

Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp Ala Ala Gin Ala Ala 
40 525 530 535 



CTT GAT GAC CTT AAG AAA GCT CAA GAA GAC AAC AAC TTG GAC GAC ATG 2343 

Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu Asp Asp Met 
540 545 550 

AAA GCA AAA CTT GAA GCA TTG AAC GAA AAA GCT CAA GGA CTT GCT GTT 2391 

Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly Leu Ala Val 
555 560 565 570 

50 AAA CTC TAC GAA CAA GCC GCA GCA GCG CAA CAA GCT CAA GAA GGA GCA 243 9 

Lys Leu Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Gin Glu Gly Ala 
575 580 585 



GAA GGC GCA CAA GCA ACA GGA AAC GCA GGC GAT GAC GTC GTA GAC GGA 2487 
Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp Val Val Asp Gly 
590 595 600 



GAG TTT ACG GAA AAG TAAGATGAGT GTATTGGATG AAGAGTATCT AAAAAATACA 2542 
Glu Phe Thr Glu Lys 
60 605 



65 



70 



CGAAAAGTTT 


ATAATGATTT 


TTGTAATCAA 


GCTGATAACT 


ATAGAACATC 


AAAAGATTTT 


2602 


ATTGATAATA 


TTCCAATAGA 


ATATTTAGCT 


AGATATAGAG 


AAATTATATT 


AGCTGAGCAT 


2662 


GATAGTTGTG 


TCAAAAATGA 


TGAAGCGGTA 


AGGAATTTTG 


TTACCTCAGT 


ATTGTTGTCT 


2722 


GCATTTGTAT 


CGGCGATGGT 


ATCAGCTAT3 


ATATCATTAG 


AAATACAAAC 


ATATAAATTT 


2782 


GTAATACCGT 


TCATAATTGG 


TATGATTTGG 


ACAGTAGTTG 


TATTTCTTAT 


GATCAATTG6 


2842 
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AATTATATAG 


GCAAATACTA 


AGAAGAGACA 


AAT^TATATA 


AATATTTCTC 


TACTTATAGG 


2902 




ATATTTAAAA 


TCCAAATAAA 


GTTAATTTAC 


TTATTTGCAG 


AGGTTGCAAC 


CCAGCCTCTG 


2962 


5 


TTTTTCGATA 


AAAAGGGACG 


GAATCTCATT 


TGTTTGGGTT 


TTGTCTCATC 


AATAGAAAGG 


3022 




AACAAAGAGT 


GTTCGTAACT 


GAACACGGGT 


TTCAGAATTT 


CTTACTAAAT 


ATAAAAGAAA 


3082 


10 


GGAATTGAAC 


CCGACCTAAA 


TGGTGGTTCG 


ATTCAGAACA 


TCAATAGAAA 


GGAATAAGGG 


3142 


TGTTCGTAAC 


TGAACACGGG 


CTACGGACTG 


TGCCAAAAAG 


ATAGTTTTTT 


CTAGGACGTA 


3202 




AGCGTCCGTC 


GTCAAAACTC 


CTAGATGGCT 


GTGTCCGTTT 


GACGCCCTTT 


GTATCTTGAA 


3262 


15 


TT ATG AAC 
Met Asn 
1 


AAT ACT GAA TTT TAT GAT CGT CTG GGG GTA TCC 
Asn Thr Glu Phe Tyx Asp Arg Leu Gly Val Ser 
5 10 


AAA AAC 
Lys Asn 
15 


3309 



30 



50 



GCT TCG GCA GAC GAA ATC AAA AAG GCT TAT CGT AAG CTT TCC AAA AAA 3357 
20 Ala Ser Ala Asp Glu lie Lys Lys Ala Tyr Arg Lys Leu Ser Lys Lys 

20 25 30 

TAT CAC CCA GAT ATC AAC AAG GAG CCT GOT GCT GAG GAC AAG TAC AAG 3405 
Tyr His Pro Asp lie Asn Lys Glu Pro Gly Ala Glu Asp Lys Tyr Lys 
25 35 40 45 

GAA GTT CAA GAA GCC TAT GAG ACT TTG ACT GAC GAC CAA AAA CGT GCT 3453 
Glu Val Gin Glu Ala Tyr Glu Thr Leu Ser Asp Asp Gin Lys Arg Ala 
50 55 60 



GCC TAT GAC CAG TAT GGT GCT GCA GGC GCC AAT GGT GGT TTT GGT GGA 3501 
Ala Tyr Asp Gin Tyr Gly Ala Ala Gly Ala Asn Gly Gly Phe Gly Gly 
65 70 75 



35 GCT GGT GGT TTC GGC GGT TTC AAT GGG GCA GGT GGC TTC GGT GGT TTT 3549 
Ala Gly Gly Phe Gly Gly Phe Asn Gly Ala Gly Gly Phe Gly Gly Phe 
80 85 90 95 

GAG GAT ATT TTC TCA AGT TTC TTC GGC GGA GGC GGT TCT TCG CGC AAT 3 597 

40 Glu Asp lie Phe Ser Ser Phe Phe Gly Gly Gly Gly Ser Ser Arg Asn 

100 105 110 

CCA AAC GCT CCT CGC CAA GGA GAT GAT CTC CAG TAT CGT GTC AAT TTG 3 64 5 

Pro Asn Ala Pro Arg Gin Gly Asp Asp Leu Gin Tyr Arg Val Asn Leu 
45 115 120 125 

ACC TTT GAA GAA GCT ATC TTC GGA ACT GAG AAG GAA GTT AAG TAT CAT 3693 
Thr Phe Glu Glu Ala He Phe Gly Thr Glu Lys Glu Val Lys Tyr His 
130 135 140 



CGT GAA GCT GGC TGT CGT ACA TGT AAT GGA TCT GGT GCT AAG CCA GGG 3741 
Arg Glu Ala Gly Cys Arg Thr Cys Asn Gly Ser Gly Ala Lys Pro Gly 
145 150 155 



55 ACA AGT CCA GTC ACT TGT GGA CGC TGT CAT GGC GCT GGT GTC ATT AAC 3789 
Thr Ser Pro Val Thr Ci's Gly Arg Cys His Gly Ala Gly Val He Asn 
160 165 170 175 

GTC GAT ACG CAG ACT CCT CTT GGT ATG ATG CGT CGC CAA GTA ACC TGT 3837 
60 Val Asp T*ir Gin Thr Pro Leu Gly Met Met Arg Arg Gin Val Thr Cys 

180 185 190 

a^^T GTC TGT CAC GGT CGA GGA AAA GAA ATC AAA TAT CCA TGT ACA ACC 3885 
Asp Val Cys His Gly Arg Gly Lys Glu He Lys Tyr Pro Cys T^ir Thr 
65 195 200 205 

TGT CAT GGA ACA GGT CAT GAG AAA CAA GCT CAT AGC GTA CAT GTG AAA 3933 
Cys His Gly Thr Gly His Glu Lys Gin Ala His Ser Val His Val Lys 
210 215 220 
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ATC CCT GCT GGT GTG GAA ACA GGT CAA CAA ATT CGC CTC OCT GOT CAA 3 981 - 

lie Pro Ala Gly Val Glu Thr Gly Gin Gin lie Arg Leu Ala Gly Gin 
225 230 235 

5 

GGT GAA GCA GGC TTT AAC GGT GGA CCT TAT GGT GAC TTG TAT GTA GTA 4029 
Gly Glu Ala Gly Phe Asn Gly Gly Pro Tyr Gly Asp Leu Tyr Val Val 
240 245 250 255 

10 GTT TCT GTG GAA GCT AGT GAC AAG TTT GAA CGT GAA GGA ACG ACT ATC 4 077 

Val Ser Val Glu Ala Ser Asp Lys Phe Glu Arg Glu Gly Utir Thr He 
260 265 270 

TTC TAC AAT CTC AAC CTC AAC TTT GTC CAA GCG GCT CTT GGT GAT ACA 4125 
15 Phe Tyr Asn Leu Asn Leu Asn Phe Val Gin Ala Ala Leu Gly Asp 'Rxr 

275 280 285 

GTA GAT ATT CCA ACT GTT CAC GGT GAT GTT GAA TTG GTT ATT CCA GAG 4173 
Val Asp Xle Pro Thr Val His Gly Asp Val Glu Leu Val He Pro Glu 
20 290 295 300 



25 



GGA ACT CAG ACT GGT AAG AAA TTC CGC CTA CGT AGT AAG GGG GCA CCG 4221 
Gly Thr Gin Thr Gly Lys Lys Phe Arg Leu Arg Ser Lys Gly Ala Pro 
305 310 315 

AGC CTT CGT GGC GGT GCA GTT GGT GAC CAA TAC GTT ACT GTT AAT GTC 4269 
Ser Leu Arg Gly Gly Ala Val Gly Asp Gin Tyr Val Thr Val Asn Val 
320 325 330 335 

30 GTA ACA CCG ACA GGC TTG AAC GAC CGC CAA AAA GTA GCC TTG AAA GAA 4317 
Val Thr Pro Thr Gly Leu Asn Asp Arg Gin Lys Val Ala Leu Lys Glu 
340 345 350 



TTC 
35 Phe 



4320 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

50 Met Ser Lys He lie Gly He Asp Leu Gly Thr Thr Asn Ser Ala Val 
1 5 -10 15 



40 



45 



55 



Ala Val Leu Glu Gly Thr Glu Ser Lys He He Ala Asn Pro Glu Gly 

20 25 30 

Asn Arg Thr Thr Pro Ser Val Val Ser Phe Lys Asn Gly Glu He He 

35 40 45 



Val Gly Asp Ala Ala Lys Arg Gin Ala Val Thr Asn Pro Asp Thr Val 
60 50 55 60 

He Ser He Lys Ser Lys Met Gly Thr Ser Glu Lys Val Ser Ala Asn 
65 70 75 80 

65 Gly Lys Glu Tyr Thr Pro Gin Glu He Ser Ala Met He Leu Gin Tyr 

85 90 95 

Leu Lys Gly Tyr Ala Glu Asp Tyr Leu Gly Glu Lys Val Thr Lys Ala 
100 105 110 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Val lie Ttir Val Pro Ala Tyr Phe Asn Asp Ala Gin Arg Gin Ala Thr 
115 120 125 

Lys Asp Ala Gly Lys lie Ala Gly Leu Glu Val Glu Arg lie Val Asn 
130 135 140 

Glu Pro Thr Ala Ala Ala Leu Ala Tyr Gly Leu Asp Lys Thr Asp Lys 
145 150 155 160 

Glu Glu Lys lie Leu Val Phe Asp Leu Gly Gly Gly Thr Phe Asp Val 
165 170 175 

Ser lie Leu Glu Leu Gly Asp Gly Val Phe Asp Val Leu Ser Thr Ala 
180 185 190 

Gly Asp Asn Lys Leu Gly Gly Asp Asp Phe Asp Gin Lys lie lie Asp 
195 200 205 

His Leu Val Ala Glu Phe Lys Lys Glu Asn Gly lie Asp Leu Ser Thr 
210 215 220 

Asp Lys Met Ala Met Gin Arg Leu Lys Asp Ala Ala Glu Lys Ala Lys 

225 230 235 240 

Lys Asp Leu Ser Gly Val Thr Ser Thr Gin He Ser Leu Pro Phe He 
245 250 255 

Thr Ala Gly Glu Ala Gly Pro Leu His Leu Glu Met Thr Leu Thr Arg 

260 265 270 

Ala Lys Phe Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr Lys Val 
275 280 285 

Pro Val Arg Gin Ala Leu Ser Asp Ala Gly Leu Ser Leu Ser Glu He 

290 295 300 



Asp Glu Val He Leu Val Gly Gly Ser Thr Arg He Pro Ala Val Val 
305 310 315 320 

Glu Ala Val Lys Ala Glu Thr Gly Lys Glu Pro Asn Lys Ser Val Asn 

325 330 335 



Pro Asp Glu Val Val Ala Met Gly Ala Ala He Gin Gly Gly Val He 
340 345 350 

Thr Gly Asp Val Lys Asp Val Val Leu Leu Asp Val Thr Pro Leu Ser 
355 360 365 

Leu Gly He Glu Thr Met Gly Gly Val Phe Thr Lys Leu He Asp Arg 
370 375 380 

Asn Thr Thr He Pro Thr Ser Lys Ser Gin Val Phe Ser Thr Ala Ala 
385 390 395 400 

Asp Asn Gin Pro Ala Val Asp He His Val Leu Gin Gly Glu Arg Pro 
405 410 415 

Met Ala Ala Asp Asn Lys Thr Leu Gly Arg Phe Gin Leu Thr Asp He 
420 425 430 

Pro Ala Ala Pro Arg Gly He Pro Gin He Glu Val Thr Phe Asp He 

435 440 445 

Asp Lys Asn Gly He Val Ser Val Lys Ala Lys Asp Leu Gly Thr Gin 
450 455 460 



Lys Glu Gin Thr He Val He Gin Ser Asn Ser Gly Leu Thr Asp Glu 
465 470 475 480 
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Glu He Asp Arg Met Met Lys Asp Ala Glu Ala Asn Ala Glu Ser Asp 
485 490 495 

5 Isys Lys Arg Lys Glu Glu Val Asp Leu Arg Asn Glu Val Asp Gin Ala 
500 505 510 

He Phe Ala Thr Glu Lys Ttir He Lys Glu Thr Glu Gly Lys Gly Phe 
515 520 525 

Asp Ala Glu Arg Asp Ala Ala Gin Ala Ala Leu Asp Asp Leu Lvs Lvs 
530 535 540 

Ala Gin Glu Asp Asn Asn Leu Asp Asp Met Lys Ala Lys Leu Glu Ala 
15 545 550 555 56o 

Leu Asn Glu Lys Ala Gin Gly Leu Ala Val Lys Leu OVr Glu Gin Ala 

565 570 575 

20 Ala Ala Ala Gin Gin Ala Gin Glu Gly Ala Glu Gly Ala Gin Ala Thr 
580 585 590 



25 



Gly Asn Ala Gly Asp Asp Val Val Asp Gly Glu Phe Thr Glu Lys 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 3 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



35 



40 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Asn Asn Thr Glu Phe Tyr Asp Arg Leu Gly Val Ser Lys Asn Ala 
1 5 . 10 15 

Ser Ala Asp Glu lie Lys Lys Ala Tyr Arg Lys Leu Ser Lys Lys Tyr 

20 25 30 



His Pro Asp lie Asn Lys Glu Pro Gly Ala Glu Asp Lys Tyr Lys Glu 
45 35 40 45 

Val Gin Glu Ala Tyr Glu Thr Leu Ser Asp Asp Gin Lys Arg Ala Ala 
50 55 60 

50 Tyr Asp Gin Tyr Gly Ala Ala Gly Ala Asn Gly Gly Phe Gly Gly Ala 
^5 70 75 80 

Gly Gly Phe Gly Gly Phe Asn Gly Ala Gly Gly Phe Gly Gly Phe Glu 
55 '° 55 

Asp He Phe Ser Ser Phe Phe Gly Gly Gly Gly Ser Ser Arg Asn Pro 
100 105 110 

Asn Ala Pro Arg Gin Gly Asp Asp Leu Gin lyr Arg Val Asn Leu Thr 
60 115 120 125 

Phe Glu Glu Ala He Phe Gly Thr Glu Lys Glu Val Lys Tyr His Arg 

130 135 140 

65 Glu Ala Gly Cys Arg Thr Cys Asn Gly Ser Gly Ala Lys Pro Gly Thr 
145 150 155 160 

Ser Pro Val Thr Cys Gly Arg Cys His Gly Ala Gly Val He Asn Val 
165 170 175 
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Asp T^ir Gin Thr Pro Leu Gly Met Met Arg Arg Gin Val mir Cys Asp 
180 185 190 

5 Val Cys His Gly Arg Gly Lys Glu lie Lys TVr Pro Cys Thr Thr Cys 
195 200 205 



10 



His Gly Thr Gly His Glu Lys Gin Ala His Ser Val His Val Lys lie 
210 215 220 

Pro Ala Gly Val Glu Thr Gly Gin Gin He Axg Leu Ala Gly Gin Gly 
225 230 235 240 



Glu Ala Gly Phe Asn Gly Gly Pro Tyr Gly Asp Leu Tyr Val Val Val 
15 245 250 255 

Ser Val Glu Ala Ser Asp Lys Phe Glu Arg Glu Gly Thr Thr He Phe 
260 265 270 

20 Tyr Asn Leu Asn Leu Asn Phe Val Gin Ala Ala Leu Gly Asp Thr Val 
275 280 285 



25 



35 



45 



60 



Asp He Pro Thr Val His Gly Asp Val Glu Leu Val He Pro Glu Gly 

290 295 300 

Thr Gin Thr Gly Lys Lys Phe Arg Leu Arg Ser Lys Gly Ala Pro Ser 

305 310 . 315 320 



Leu Arg Gly Gly Ala Val Gly Asp Gin Tyr Val Thr Val Asn Val Val 
30 325 330 335 



Thr Pro Thr Gly Leu Asn Asp Arg Gin Lys Val Ala Leu Lys Glu Phe 
340 345 350 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Thr Ser Thr Gin He Ser Leu Pro Phe He Thr Ala Glv Glu Ala 
50 1 5 10 ' 15 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: ' 
55 (A) LENGTH: 15 amino acids * 

(B) TYPE: amino acid ' 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



65 Thr Ala Gly Glu Ala Gly Pro Leu His Leu Glu Met Thr Leu Thr 

15 10 15 

(2) INFORMATION FOR SEQ ID NO: 9: 

70 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 
60 



65 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Thr Leu Thr Arg Ala Lys Phe Asp Asp Leu Thr Arg Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10 : 

Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr Lys Val Pro Val 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Thr Lys Val Pro Val Arg Gin Ala Leu Ser Asp Ala Gly Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr lie Val lie 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide # 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Leu Thr Asp Glu lie Asp Arg Met Met Lys Asp Ala Glu Ala 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TV^PE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Lys Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu 
15 10 15 

Val Asp Leu Arg Asn Glu Val Asp 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Glu Val Asp Gin Ala lie Phe Ala Thr Glu Lys Thr lie Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glu Lys Thr lie Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg 
15 10 15 

Asp Ala Ala Gin Ala Ala Leu Asp Asp Leu Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 



1 



5 



10 
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Lys Ala Gin Glu Asp Asn Asn Leu Asp Asp Met Lys Ala Lys Leu Glu 
15 10 15 

5 Ala Leu Asn Glu Lys Ala Gin Gly Leu Ala Val Lvs Leu Tyr 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 18: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



20 



40 



50 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gin Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp 
15 10 15 



Val Val Asp Gly Glu Phe Thr Glu Lys 
25 20 25 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 



(ix) FEATURE: 
45 (A) NAME/KEY: CDS 

(B) LOCATION: 204.. 2030 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
CAGCGATGGT AGTTGTTTAT AACTAAGGTA AATGAGTTTT CGTTTTTGTC CGTAATGACA 60 
GTAAACTAGA TAGCAAGTTA GAAGCTATTT CGCTTGCTGA TTAAACTATA GTGATTGCTT 120 



55 AGAATTGGAA GTAAAATAAT TCGAGTGCTT ACTAAGATAA ATTGAAATAA AAAGTAATAA 180 I 



AGTATAAAAT AAGAGGTATT AAC ATG TCT AAA ATT ATT GGT ATT GAC TTA 23 0 

Met Ser Lys lie lie Gly lie Asp Leu 
1 5 

GGT ACA ACA AAC TCA GCA GTA GCA GTT CTT GAA GGG ACT GAA TCA AAA 278 
Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser Lys 
10 15 20 25 



65 ATC ATT GCT AAC CCA GAA GGC AAT CGT ACA ACT CCT TCA GTA GTA TCA 326 
lie lie Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val Ser 
30 35 40 

TTC AAA AAT GGT GAA ATT ATC GTG GGT GAT GCT GCA AAA CGC CAA GCA 374 
70 Phe Lys Asn Gly Glu lie lie Val Gly Asp Ala Ala Lys Arg Gin Ala 
45 50 55 
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GTG ACA AAC CCA GAA ACA GTA ATC TCT ATT AAA TCT AAA ATG GGA ACT 422 - 

Val Tzir Asn Pro Glu THr Val lie Sex lie Lys Ser Lys Met Gly Thr 
60 65 70 

5 

TCT GAA AAA GTT TCT GCA AAT GOT AAA GhA TAT ACT CCT CAA GAA ATT 470 

Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu lie 
75 80 85 

10 TCA GCA ATG ATT CTT CAA TAC CTT AAA GGT TAT GCT GAA GAC TAT CTT 518 
Ser Ala Met lie Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr Leu 
90 95 100 105 

GGA GAA AAA GTA GAA AAA GCA GTT ATT ACT GTT CCA GCT TAT TTC AAC 566 
15 Gly Glu Lys Val Glu Lys Ala Val He Tnr Val Pro Ala Tyr Phe Asn 

110 115 120 

GAT GCA CAA CGT CAA GCA ACT AAA GAC GCT GGT AAA ATT GCA GGT CTT 614 
Asp Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys He Ala Gly Leu 
20 125 130 135 

GAA GTA GAA CGT ATC GTT AAT GAA CCA ACA GCA GCT GCA CTT GCT TAT 662 
Glu Val Glu Arg He Val Asn Glu Pro Thr Ala Ala Ala Leu Ala Tyr 
140 145 150 



25 



65 



GGT ATG GAC AAG ACT GAC AAG GAT GAA AAA ATC TTA GTT TTT GAC CTT 710 
Gly Met Asp Lys Thr Asp Lys Asp Glu Lys He Leu Val Phe Asp Leu 
155 160 165 



30 GGT GGT GGT ACA TTT GAC GTA TCA ATC CTT GAA TTA GGT GAT GGT GTC 758 
Gly Gly Gly Thr Phe Asp Val Ser He Leu Glu Leu Gly Asp Gly Val 
170 175 180 185 

TTC GAC GTT CTT GCA ACA GCA GGT GAT AAC AAA CTT GGT GGT GAC GAC 806 
35 Phe Asp Val Leu Ala Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp Asp 

190 195 200 

TTT GAC CAA AAA ATT ATT GAT TTC TTA GTG GCT GAA TTT AAG AAA GAA 854 
Phe Asp Gin Lys He He Asp Phe Leu Val Ala Glu Phe Lys Lys Glu 
40 205 210 215 

. AAT GGT ATT GAC TT^ TCA CAA GAT AAG ATG GCA CTT CAA CGC TTG AAA 902 
Asn Gly He Asp Leu Ser Gin Asp Lys Met Ala Leu Gin Arg Leu Lys 
220 225 230 

45 

GAT GCT GCT GAA AAA GCT AAA AAA GAT CTT TCA GGT GTG ACA CAA ACA 950 
Asp Ala Ala Glu Lys Ala Lys Lys Asp Leu Ser Gly Val Thr Gin Thr 
235 240 245 

50 CAA ATT TCA TTA CCG TTC ATC ACT GCT GGT TCT GCT GGT CCT CTT CAC 998 
Gin He Ser Leu Pro Phe He Thr Ala Gly Ser Ala Gly Pro Leu His 
250 255 260 265 

TTA GAG ATG AGC TTA TCT CGT GCT AAA TTT GAC GAT CTC ACT CGT GAC 1046 
55 Leu Glu Met Ser Leu Ser Arg Ala Lys Phe Asp Asp Leu Hir Arg Asp 

270 275 280 

CTT GTT GAA CGT ACG AAA ACT CCA GTT CGT CAA GCT CTT TCA GAT GCA 1094 
Leu Val Glu Arg Thr Lys Ttix Pro Val Arg Gin Ala Leu Ser Asp Ala 
60 285 290 295 

GGA TTG TCA TTG TCA GAA ATT GAT GAA GTT ATC CTT GTT GGT GGA TCA 1142 
Gly Leu Ser Leu Ser Glu He Asp Glu Val He Leu Val Gly Gly Ser 
300 305 310 



ACT CGT ATC CCA GCA GTT GTC GAA GCT GTA AAA GCT GAA ACT GGT AAA 1190 
Thr Arg He Pro Ala Val Val Glu Ala Val Lys Ala Glu Thr Gly Lys 
315 320 325 
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GAA CCA AAT AAA TCT GTA AAC CCT GAT GAA GTG GTT GCT ATG GGT GCT 1238 - 

Glu Pro Asn Lys Ser Val Asn Pro Asp Glu Val Val Ala Met Gly Ala 
330 335 340 345 

GCT ATC CAA GGT GGG GTT ATC ACT GGG GAT GTG AAA GAC GTT GTC CTT 1286 
Ala He Gin Gly Gly Val He Thr Gly Asp Val Lys Asp Val Val Leu 
350 355 360 

10 CTT GAC GTA AC A CCA TTG TCA CTT GGT ATT GAA AC A ATG GGT GGT GTC 1334 
Leu Asp Val Thr Pro Leu Ser Leu Gly He Glu Thr Met Gly Gly Val 
365 370 375 

TTC ACT AAA TTG ATC GAC CGC AAT ACA ACT ATC CCA ACA TCT AAA TCA 1382 
15 Phe Thr Lys Leu He Asp Arg Asn Thr Thr He Pro Thr Ser Lys Se^ 
380 385 390 

CAA GTC TTC TCA ACA GCA GCA GAC AAC CAA CCA GCC GTT GAT ATC CAT 1430 
Gin Val Phe Ser Ihr Ala Ala Asp Asn Gin Pro Ala Val Asp He His 
20 395 400 405 



25 



55 



65 



GTT CTT CAA GGT GAA CGC CCA ATG GCA GCA GAT AAC AAG ACT CTT GGT 1478 
Val Leu Gin Gly Glu Arg Pro Met Ala Ala Asp Asn Lys Thr Leu Gly 
410 415 420 425 

CGC TTC CAA TTG ACT GAT ATC CCA GCT GCA CCT CGT GGA ATC CCA CAA 1526 
Arg Phe Gin Leu Thr Asp He Pro Ala Ala Pro Arg Gly He Pro Gin 
430 435 440 



30 ATT GAA GTA ACA TTT GAT ATC GAT AAA AAC GGT ATT GTT TCT GTA AAA 1574 
He Glu Val Thr Phe Asp He Asp Lys Asn Gly He Val Ser Val Lys 
445 450 455 

GCT AAA GAC CTT GGT ACG CAA AAG GAA CAA CAC ATC GTT ATC AAA TCA 1622 
35 Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin His He Val He Lys Ser 

460 465 470 

AAC GAC GGA CTT TCT GAA GAA GAA ATT GAT CGC ATG ATG AAA GAC GCT 1670 
Asn Asp Gly Leu Ser Glu Glu Glu He Asp Arg Met Met Lys Asp Ala 
40 475 480 485 

GAA GCT AAT GCC GAA GCC GAT GCG AAA CGT hAA GAA GAA GTT GAC CTT 1718 

Glu Ala Asn Ala Glu Ala Asp Ala Lys Arg Lys Glu Glu Val Asp Leu 
490 495 500 505 

45 

AAA AAC GAA GTT GAC CAA GCT ATC TTT GCT ACT GAA AAA ACA ATC AAA 1766 

Lys Asn Glu Val Asp Gin Ala He Phe Ala Thr Glu Lys Thr He Lys 
510 515 520 

50 GAA ACT GAA GGT AAA GGC TTT GAC ACA GAA CGC GAT GCA GCG CAA TCA 1814 
Glu Thr Glu Gly Lys Gly Phe Asp Thr Glu Arg Asp Ala Ala Gin Ser 
525 530 535 



GCT CTT GAC GAG TTA AAA GCT GCG CAA GAA TCT GGC AAC CTT GAC GAC 1862 
Ala Leu Asp Glu Leu Lys Ala Ala Gin Glu Ser Gly Asn Leu Asp Asp 
540 545 550 



ATG AAA GCT AAA CTT GAA GCA TTA AAT GAA AAA GCG CAA GCT TTG GCT 1910 
Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Ala Leu Ala 
60 555 560 565 



GTT AAA ATG TAC GAG CAA GCT GCA GCA GCT CAA CAA GCA GCA CAA GGT 1958 
Val Lys Met Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Ala Gin Gly 
570 575 580 585 

GCA GAA GGT GCA CAA GCT AAT GAT TCA GCA AAT AAT GAT GAT GTT GTA 2006 
Ala Glu Gly Ala Gin Ala Asn Asp Ser Ala Asn Asn Asp Asp Val Val 
590 595 600 
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GAT GGC GAA TTT ACA GAA AAG TAATGATTTA GTTATCTAGT AACATTAATA 2057 - 

Asp Gly Glu Phe Thr Glu Lys 
605 

5 

TCCGAATTCA GAGGTTGTAC CAAACCTCTG TTTTTGGCTA AATAAAATGT AAAAATGCTG 2117 
ACGTCAAAAT ATTTTAAGAA AGGAATAC7A GTTCGATTAT TCGAACACAG GCTAAAGCGT 2177 
10 GTAAAG 2183 

(2) INFORMATION FOR SEQ ID NO: 20: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ser Lys lie He Gly He Asp Leu Gly Thr Thr Asn Ser Ala Val 
25 1 5 10 15 

Ala Val Leu Glu Gly Thr Glu ser Lys He He Ala Asn Pro Glu Gly 
20 25 30 

30 Asn Arg Thr Thr Pro Ser Val Val Ser Phe Lys Asn Gly Glu He He 
35 40 45 



35 



50 



65 



Val Gly Asp Ala Ala Lys Arg Gin Ala Val Ihr Asn Pro Glu Thr Val 

50 55 60 

He Ser He Lys Ser Lys Met Gly Thr Ser Glu Lys Val Ser Ala Asn 
65 70 75 80 



Gly Lys Glu Tyr Thr Pro Gin Glu He Ser Ala Met He Leu Gin Tyr 
40 85 90 95 

Leu Lys Gly Tyr Ala Gl*^. Asp Tyr Leu Gly Glu Lys Val Glu Lys Ala 
100 105 110 

45 Val He Thr Val Pro Ala Tyr Phe Asn Asp Ala Gin Arg Gin Ala Thr 
115 120 125 



Lys Asp Ala Gly Lys He Ala Gly Leu Glu Val Glu Arg He Val Asn 
130 135 140 

Glu Pro Thr Ala Ala Ala Leu Ala Tyr Gly Met Asp Lys Thr Asp Lys 
145 150 155 160 



Asp Glu Lys He Leu Val Phe Asp Leu Gly Gly Gly Thr Phe Asp Val 
55 165 170 175 

Ser He Leu Glu Leu Gly Asp Gly Val Phe Asp Val Leu Ala Thr Ala 
180 185 190 

60 Gly Asp Asn Lys Leu Gly Gly Asp Asp Phe Asp Gin Lys He He Asp 
195 200 205 



Phe Leu Val Ala Glu Phe Lys Lys Glu Asn Gly He Asp Leu Ser Gin 
210 215 220 

Asp Lys Met Ala Leu Gin Arg Leu Lys Asp Ala Ala Glu Lys Ala Lys 
225 230 235 240 



Lys Asp Leu Ser Gly Val Thr Gin Thr Gin He Ser Leu Pro Phe He 
70 245 250 255 
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Thr Ala Gly Ser Ala Gly Pro Leu His Leu Glu Met Ser Leu Sex Arg 
260 265 270 

Ala Lys Phe Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr Lys Thr 
5 275 280 285 

Pro Val Arg Gin Ala Leu Ser Asp Ala Gly Leu Ser Leu Ser Glu lie 

290 295 300 

10 Asp Glu Val lie Leu Val Gly Gly Ser Thr Arg lie Pro Ala Val Val 
305 310 315 320 



15 



30 



45 



60 



Glu Ala Val Lys Ala Glu Thr Gly Lys Glu Pro Asn Lys Ser Val Asn 
325 330 335 

Pro Asp Glu Val Val Ala Met Gly Ala Ala He Gin Gly Gly Val He 
340 345 350 



Thr Gly Asp Val Lys Asp Val Val Leu Leu Asp Val Thr Pro Leu Ser 
20 355 360 365 

Leu Gly He Glu Thr Met Gly Gly Val Phe Thr Lys Leu He Asp Arg 
370 375 380 

25 Asn Thr Thr He Pro Thr Ser Lys Ser Gin Val Phe Ser Thr Ala Ala 

385 390 395 4OO 



Asp Asn Gin Pro Ala Val Asp He His Val Leu Gin Gly Glu Arg Pro 
405 410 415 

Met Ala Ala Asp Asn Lys Thr Leu Gly Arg Phe Gin Leu Thr Asp He 
420 425 430 



Pro Ala Ala Pro Arg Gly He Pro Gin He Glu Val Thr Phe Asp He 
35 435 440 445 

Asp Lys Asn Gly He Val Ser Val Lys Ala Lys Asp Leu Gly Thr Gin 

450 455 460 

40 Lys Glu Gin His He Val He Lys Ser Asn Asp Gly Leu Ser Glu Glu 
465 470 475 480 



Glu He Asp Arg Met Met Lys Asp Ala Glu Ala Asn Ala Glu Ala Asp 

485 490 495 

Ala Lys Arg Lys Glu Glu Val Asp Leu Lys Asn Glu Val Asp Gin Ala 
500 505 510 



He Phe Ala Thr Glu Lys Thr He Lys Glu Thr Glu Gly Lys Glv Phe 
50 515 52 0 52 5 

Asp Thr Glu Arg Asp Ala Ala Gin Ser Ala Leu Asp Glu Leu Lys Ala 
530 535 540 

55 Ala Gin Glu Ser Gly Asn Leu Asp Asp Met Lys Ala Lys Leu Glu Ala 
545 550 555 560 



Leu Asn Glu Lys Ala Gin Ala Leu Ala Val Lys Met Tyr Glu Gin Ala 

565 570 575 

Ala Ala Ala Gin Gin Ala Ala Gin Gly Ala Glu Gly Ala Gin Ala Asn 
580 585 590 



Asp Ser Ala Asn Asn Asp Asp Val Val Asp Gly Glu Phe Thr Glu Lys 
65 595 600 605 



104 



BNSDOCID: <WO 9640928A1_I_> 



wo 96/40928 



PCT/CA96/00322 



(2) INFORMATION FOR SEQ ID NO: 21: 

<i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 (ii) MOLECXn.E TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



15 



25 



65 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 



(ix) FEATURE: 
20 (A) NAME/KEY: CDS 

(B) LOCATION: 248.. 2077 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTTTCAAAAG GGATATAAAT TGCACGAGCG TCTGCTAAGA CCAGCGATGG TAGTTGTCTA 60 

TAACTAAGGT AAATGAGTTT TCGTTTTTGT CCGTAATGAC AGTAAACTAG ATAGCAAGTT 120 

30 AGAAGCTATT CAGCTTGCTG ATTAAACTAT AGTGAITGCT TAGAATTGGA AGTAAAATAA 160 

TTCGAGTGCT TACTAAGATA AATTGAAATA AAAAGTAATA AAGTATTATA AAATAAGAGG 240 

TATTAAC ATG TCT AAA ATT ATT GGT ATT GAC TTA GGT ACA ACA AAC TCA 289 
35 Met Ser Lys lie lie Gly lie Asp Leu Gly Thr Thr Asn Ser 

15 10 

GCA GTA GCA GTT CTT GAA GGG ACT GAA TCA AAA ATC ATT GCT AAC CCA 337 
Ala Val Ala Val Leu Glu Gly Thr Glu Ser Lys He He Ala Asn Pro 
40 15 20 25 30 

GAA GGC AAT CGT ACA ACT CCT TCA GTA GTA TCA TTC AAA AAT GGT GAA 385 

Glu Gly Asn Arg Thr 'Vtir Pro Ser Val Val Ser Phe Lys Asn Gly Glu 
35 40 45 

45 

ATT ATC GTG GGT GAT GCT GCA AAA CGT CAA GCG GTA ACA AAT CCA GAT 433 

He He Val Gly Asp Ala Ala Lys Arg Gin Ala Val Thr Asn Pro Asp 
50 55 60 

50 ACT GTT ATC TCT ATC AAA TCA AAG ATG GGA ACT TCT GAA AAA GTT TCT 481 
Thr Val He Ser He Lys Ser Lys Met Gly Thr Ser Glu Lys Val Ser 
65 70 75 

GCA AAT GGT AAA GAA TAT ACT CCT CAA GAA ATT TCA GCA ATG ATT CTT 529 
55 Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu He Ser Ala Met He Leu 
80 85 90 

CAA TAC CTT AAA GGT TAT GCT GAA GAC TAT CTT GGA GAA AAA GTA GAA 577 
Gin lyr Leu Lys Gly Tyr Ala Glu Asp Tyr Leu Gly Glu Lys Val Glu 
60 95 100 105 110 

AAA GCA GTT ATT ACT GTT CCA GCT TAC TTC AAC GAT GCA CAA CGT CAG 625 
Lys Ala Val He Thr Val Pro Ala lyr Phe Asn Asp Ala Gin Arg Gin 
115 120 125 



GCA ACT AAA GAC GCT GGT AAA ATT GCA GGT CTT GAA GTA GAA CGT ATC 673 
Ala Thx Lys Asp Ala Gly Lys He Ala Gly Leu Glu Val Glu Arg He 
130 135 140 
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GTT AAC GAA CCA ACA GCA GCC GCA CTT GCT TAT GGT ATG GAC AAG ACT 721 
Val Asn Glu Pro Thr Ala Ala Ala Leu Ala Tyr Gly Met Asp Lys Thr 
^ 145 150 155 

GAC AAG GAT GAA AAA ATC TTA GTT TTT GAC CTT GGT GGT GGT ACA TTT 769 
Asp Lys Asp Glu Lys lie Leu Val Phe Asp Leu Gly Gly Gly Thr Phe 
160 165 170 

10 GAC GTA TCA ATC CTT GAA TTA GGT GAT GGT GTC TTC GAC GTT CTT GCA 817 
Asp Val Ser He Leu Glu Leu Gly Asp Gly Val Phe Asp Val Leu Ala 
175 180 185 190 

ACA GCA GGT GAT AAC AAA CTT GGT GGT GAC GAC TTT GAC CAG AAA ATT 865 
15 Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp Asp Phe Asp Gin Lys He 

195 200 205 

ATT GAT TTC TTG GTA GAA GAA TTC AAG AAA GAA AAT GGT ATT GAT CTT 913 
He Asp Phe Leu Val Glu Glu Phe Lys Lys Glu Asn Gly He Asp Leu 
20 210 215 220 



25 



35 



45 



55 



65 



TCT CAA GAC AAA ATG GCT CTT CAA CGC TTG AAA GAT GCT GCT GAA AAA 961 
Ser Gin Asp Lys Met Ala Leu Gin Arg Leu Lys Asp Ala Ala Glu Lys 

225 230 235 

GCT AAA AAA GAC CTT TCA GGT GTA ACT CAA ACT CAA ATT TCA TTA CCG 1009 
Ala Lys Lys Asp Leu Ser Gly Val Thr Gin Thr Gin He Ser Leu Pro 
240 245 250 



30 TTC ATC ACT GCT GGT TCT GCT GGT CCT CTT CAC TTG GAG ATG AGC TTA 1057 
Phe He Thr Ala Gly Ser Ala Gly Pro Leu His Leu Glu Met Ser Leu 
255 260 265 270 



TCA CGT GCT AAA TTT GAC GAT CTC ACT CGT GAC CTT GTT GAA CGT AC6 1105 
Ser Arg Ala Lys Phe Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr 

275 280 285 



AAA ACT CCA GTT CGT CAA GCT CTT TCA GAT GCA GGC TTG TCA TTG TCA 1153 
Lys Thr Pro Val Arg Gin Ala Leu Ser Asp Ala Gly Leu Ser Leu Ser 
40 290 295 300 



GAA ATT GAT GAA GTT ATC CTC GTT GGT GGA TCA ACA CGT ATC CCA GCA 1201 

Glu He Asp Glu Val He Leu Val Gly Gly Ser Thr Arg He Pro Ala 
305 310 315 

GTT GTT GAA GCT GTA AAA GCT GAA ACT GGT AAA GAA CCA AAT AAA TCT 1249 

Val Val Glu Ala Val Lys Ala Glu Thr Gly Lys Glu Pro Asn Lys Ser 
320 325 330 

50 GTT AAC CCT GAT GAA GTG GTT GCC ATG GGT GCT GCT ATC CAA GGT GGT 1297 

Val Asn Pro Asp Glu Val Val Ala Met Gly Ala Ala He Gin Gly Gly 
335 340 345 350 



GTT ATC ACT GGG GAT GTG AAA GAC GTT GTA CTT CTT GAC GTA ACA CCA 1345 
Val He Thr Gly Asp Val Lys Asp Val Val Leu Leu Asp Val Thr Pro 
355 360 365 



TTG TCA CTT GGT ATT GAA ACA ATG GGT GGT GTC TTC ACT AAA TTG ATC 1393 
Leu Ser Leu Gly He Glu Thr Met Gly Gly Val Phe Hir Lys Leu He 
60 370 375 380 



GAC CGC AAC ACA ACT ATC CCA ACA TCT AAA TCA CAA GTC TTC TCA ACA 1441 

Asp Arg Asn Thr Tt\r lie Pro Thr Ser Lys Ser Gin Val Phe Ser Thr 
385 390 395 

GCA GCA GAC AAC CAA CCA GCC GTT GAT ATC CAT GTT CTT CAA GGT GAA 1489 

Ala Ala Asp Asn Gin Pro Ala Val Asp He His Val Leu Gin Gly Glu 
400 405 410 
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CGC CCA ATG GCA GCA GAT AAC AAA ACA CTC GGT CGC TTC CAA TTG ACT 1537 ^ 

Arg Pro Met Ala Ala Asp Asn Lys Thr Leu Gly Arg Phe Gin Leu Thr 
415 420 425 430 

5 

GAT ATC CCA GCT GCA CCT CGT GGA ATC CCA CAA ATT GAA GTA ACA TTT 1585 

Asp lie Pro Ala Ala Pro Arg Gly lie Pro Gin lie Glu Val Thr Phe 

435 440 445 

10 GAT ATC GAT AAA AAT GGT ATT GTA TCT GTT AAA GCT AAA GAT CTC GGT 1633 
Asp lie Asp Lys Asn Gly lie Val Ser Val Lys Ala Lys Asp Leu Gly 
450 455 460 

ACT CAA AAA GAA CAA CAC ATT GTT ATC CAA TCT AAT TCA GGA TTA ACT 1681 
15 Thr Gin Lys Glu Gin Kis lie Val lie Gin Ser Asn Ser Gly Leu T!hr 

465 470 475 

GAT GAA GAA ATT GAT AAA ATG ATG AAA GAT GCT GAA GCA AAT GCT GAA 1729 
Asp Glu Glu lie Asp Lys Met Met Lys Asp Ala Glu Ala Asn Ala Glu 
20 480 485 490 

GCA GAT GCA AAA CGT AAA GAA GAA GTT GAT CTT AAA AAT GAA GTT GAC 1777 
Ala Asp Ala Lys Arg Lys Glu Glu Val Asp Leu Lys Asn Glu Val Asp 
495 500 505 510 

25 

CAA GCC ATC TTT GCA ACA GAA AAA ACT ATT AAA GAA ACT GAA GGC AAA 1825 
Gin Ala lie Phe Ala Thr Glu Lys Thr lie Lys Glu Thr Glu Gly Lys 

515 520 525 

30 GGT TTT GAT ACA GAA CGC GAT GCA GCG CAA TCA GCA CTT GAT GAG TIG 1873 
Gly Phe Asp Thr Glu Arg Asp Ala Ala Gin Ser Ala Leu Asp Glu Leu 
530 535 540 

AAA AAA GCT CAA GAA TCA GGT AAC CTT GAC GAC ATG AAA GCT AAA CTT 1921 
35 Lys Lys Ala Gin Glu Ser Gly Asn Leu Asp Asp Met Lys Ala Lys Leu 

545 550 555 

GAA GCT CTT AAC GAA AAA GCA CAA GCT CTT GCA GTT AAA CTT TAC GAA 1969 
Glu Ala Leu Asn Glu Lys Ala Gin Ala Leu Ala Val Lys Leu Tyr Glu 
40 560 565 570 

CAA GCG GCT GCA GCA CAA CAA GCA GCT CAA GGG GCT GAA GGT GCA CAA • 2 017 

Gin Ala Ala Ala Ala Gin Gin Ala Ala Gin Gly Ala Glu Gly Ala Gin 
575 580 585 590 

45 

TCA GCT GAT TCA TCA AGC AAG GGT GAT GAT GTT GTA GAT GGC GAA TTC 2065 
Ser Ala Asp Ser Ser Ser Lys Gly Asp Asp Val Val Asp Gly Glu Phe 
595 600 605 

50 ACT GAG AAA TAATTATTAA TATTGTTCAG ATTCATTTGA ATATAAGCAT 2114 
Thr Glu Lys 

610 



55 



GAAAACTATA CTAGCATAGT AAAGTTCTTC GTGATAGGGA TTGCTCAATA ATCTAGATAA 2174 

GTTTCAGATT ACATAAGCTA ATTTCGCTAT CACTAAATAA AAACATATTA ATAATAAATA 2234 

GGCGGGGCGC CTCGCTCCGT CTGTTTTATT AAGTGTCATA TATATGTTAA CTATTTAGAG 2294 

60 CTGTAACTGG GCAAGAATAA TTGTTAATCT CTTCAAGTGT AGTATATGAA CAAAATATAA 2354 

AGGATTAGAT AATGAACAAT ACAGAATTTT ATGATCGTCT TGGCGTTTCA AAAGATGCTT 2414 

CTCAGGACGA AATAAAAAAA GCTT 243 8 



65 



(2) INFORMATION FOR SEQ ID NO: 22: 



(il SEQUENCE CHARACTERISTICS: 
70 (A) LENGTH: 609 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ser Lys lie lie Gly lie Asp Leu Gly Thr Thr Asn Ser Ala Val 
15 10 15 

10 Ala Val Leu Glu Gly Thr Glu Ser Lys lie lie Ala Asn Pro Glu Gly 
20 25 30 



15 



30 



45 



60 



Asn Arg Thr Thr Pro Ser Val Val Ser Phe Lys Asn Gly Glu lie lie 

35 40 45 

Val Gly Asp Ala Ala Lys Arg Gin Ala Val Thr Asn Pro Asp Thr Val 

50 55 60 



He Ser He Lys Ser Lys Met Gly Thr Ser Glu Lys Val Ser Ala Asn 
20 65 70 75 80 

Gly Lys Glu Tyr Thr Pro Gin Glu He Ser Ala Met He Leu Gin Tyr 
85 90 95 

25 Leu Lys Gly Tyr Ala Glu Asp Tyr Leu Gly Glu Lys Val Glu Lys Ala 

100 105 HO 



Val He Thr Val Pro Ala T^^r Phe Asn Asp Ala Gin Arg Gin Ala Thr 
115 120 125 

Lys Asp Ala Gly Lys He Ala Gly Leu Glu Val Glu Arg He Val Asn 

130 135 140 



Glu Pro Thr Ala Ala Ala Leu Ala Tyr Gly Met Asp Lys Hir Asp Lys 
35 14 5 150 155 160 

Asp Glu Lys He Leu Val Phe Asp Leu Gly Gly Gly Thr Phe Asp Val 

165 170 175 

40 Ser He Leu Glu Leu Gly Asp Gly Val Phe Asp Val Leu Ala Thr Ala 
180 185 190 



Gly Asp Asn Lys Leu Gly Gly Asp Asp Phe Asp Gin Lys He He Asp 
195 200 205 

Phe Leu Val Glu Glu Phe Lys Lys Glu Asn Gly He Asp Leu Ser Gin 
210 215 220 



Asp Lys Met Ala Leu Gin Arg Leu Lys Asp Ala Ala Glu Lys Ala Lys 
50 225 230 235 240 

Lys Asp Leu Ser Gly Val Thr Gin Thr Gin He Ser Leu Pro Phe He 
245 250 255 

55 Thr Ala Gly Ser Ala Gly Pro Leu His Leu Glu Met Ser Leu Ser Arg 
260 265 270 



Ala Lys Phe Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr Lys Thr 
275 280 285 

Pro Val Arg Gin Ala Leu Ser Asp Ala Gly Leu Ser Leu Ser Glu He 
290 295 300 



Asp Glu Val He Leu Val Gly Gly Ser Thr Arg He Pro Ala Val Val 
65 305 310 315 320 

Glu Ala Val Lys Ala Glu Thr Gly Lys Glu Pro Asn Lys Ser Val Asn 
325 330 335 

70 Pro Asp Glu Val Val Ala Met Gly Ala Ala He Gin Gly Gly Val He 
340 345 350 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Thr Gly Asp Val Lys Asp Val Val Leu Leu Asp Val Thr Pro Leu Ser 
355 360 365 

Leu Gly He Glu Thr Met Gly Gly Val Phe IJir Lys Leu He Asp Arg 
370 375 380 

Asn Ttir Thr He Pro Thr Ser Lys Ser Gin Val Phe Ser Thr Ala Ala 
385 390 395 400 

Asp Asn Gin Pro Ala Val Asp He His Val Leu Gin Gly Glu Arg Pro 
405 410 415 

Met Ala Ala Asp Asn Lys Thr Leu Gly Arg Phe Gin Leu 'Thr Asp He 
420 425 430 

Pro Ala Ala Pro Arg Gly He Pro Gin He Glu Val Thr Phe Asp He 

435 440 445 



Asp Lys Asn Gly He Val Ser Val Lys Ala Lys Asp Leu Gly l^ir Gin 
450 455 460 

Lys Glu Gin His He Val He Gin Ser Asn Ser Gly Leu Thr Asp Glu 

465 470 475 480 

Glu He Asp Lys Met Met Lys Asp Ala Glu Ala Asn Ala Glu Ala Asp 
485 490 495 

Ala Lys Arg Lys Glu Glu Val Asp Leu Lys Asn Glu Val Asp Gin Ala 
500 505 510 

He Phe Ala Thr Glu Lys Thr He Lys Glu Thr Glu Gly Lys Gly Phe 
515 520 525 

Asp Thr Glu Arg Asp Ala Ala Gin Ser Ala Leu Asp Glu Leu Lys Lys 

530 535 540 

Ala Gin Glu Ser Gly Asn Leu Asp Asp Met Lys Ala Lys Leu Glu Ala 

545 550 555 560 

Leu Asn Glu Lys Ala Gin Ala Leu Ala Val Lys Leu Tyr Glu Gin Ala 
565 570 575 



Ala Ala Ala Gin Gin Ala Ala Gin Gly Ala Glu Gly Ala Gin Ser Ala 
580 585 590 

Asp Ser Ser Ser Lys Gly Asp Asp Val Val Asp Gly Glu Phe Thr Glu 
595 600 605 



Lys 



(2) INFORMATION FOR SEQ ID N0:23 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amine acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID KO:23: 

Arg He Pro Ala Val Val Glu Ala Val Lys Ala Glu Thr Gly Lys Glu 
15 10 15 



Pro Asn Lys 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



10 



40 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

15 Gin Thr lie Val lie Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu 

15 10 15 

(2) INFORMATION FOR SEQ ID NO: 25: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

35 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..456 

(D) OTHER INFORMATION: /product = "C- terminal 151-residue 
fragment (C-151) of HSP72" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 25: 



ATG AAG GCC AAA GAC CTT GGA ACT CAA AAA GAA CAA ACT ATT GTC ATC 48 
45 Met Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr lie Val lie 
15 10 15 

CAA TCG AAC TCA GGT TTG ACT GAC GAA GAA ATC GAC CGC ATG ATG AAA 9 6 

Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu lie Asp Arg Met Met Lys 
50 20 25 30 

GAT GCA GAA GCA AAC GCT GAA TCC GAT AAG AAA CGT AAA GAA GAA GTA 144 

Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu Val 

35 40 45 

55 

GAC CTT CGT AAT GAA GTG GAC CAA GCA ATC TTT GCG ACT GAA AAG ACA 192 

Asp Leu Arg Asn Glu Val Asp Gin Ala He Phe Ala Thr Glu Lys Thr 
50 55 60 

60 ATC AAG GAA ACT GAA GGT AAA GGC TTC GAC GCA GAA CGT GAC GCT GCC 240 
He Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp Ala Ala 
65 70 75 . 80 

CAA GCT GCC CTT GAT GAC CTT AAG AAA GCT CAA GAA GAC AAC AAC TTG 288 
65 Gin Ala Ala Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu 

85 90 95 

GAC GAC ATG AAA GCA AAA CTT GAA GCA TTG AAC GAA AAA GCT CAA GGA 336 
Asp Asp Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly 
70 100 105 110 
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CTT GCT GTT AAA CTC TAC GAA CAA GCC GCA GCA GCG CAA CAA OCT CAA 384 
Leu Ala Val Lys Leu Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Gin 
115 120 125 

5 GAA GGA GCA GAA GGC GCA CAA GCA ACA GGA AAC GCA GGC GAT GAC GTC 432 
Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp Val 
130 135 140 

GTA GAC GGA GAG TTT ACG GAA AAG TAAG 460 
10 Val Asp Gly Glu Phe 'Thr Glu Lys 
145 150 



{2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 cunino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 Met Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr lie Val lie 
15 10 15 



15 



20 



30 



45 



Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu He Asp Arg Met Met Lys 
20 25 30 

Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu Val 
35 40 45 



Asp Leu Arg Asn Glu Val Asp Gin Ala He Phe Ala Thr Glu Lys Thr 
35 50 55 60 

He Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp Ala Ala 
65 70 75 80 

40 Gin Ala Ala Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu 

85 90 95 



Asp Asp Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly 

100 105 110 

Leu Ala Val Lys Leu Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Gin 
115 120 125 



Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp Val 
50 130 135 140 

Val Asp Gly Glu Phe Ttir Glu Lys 
145 150 
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We claim: 



1. A pol2/peptide selected from the group 
consisting of: 

(a) the HSP72 polypeptide having the amino 
acid secjuence of SEQ ID NO:5; 

(b) the HSP70(DnaK) polypeptide having the 
amino acid sequence of SEQ ID NO:20; 

(c) the HSP70 (DnaK) polypeptide having 
the amino acid sequence of SEQ ID N0:22; 

(d) polypeptides that are immunologically 
reactive with antibodies generated by infection of a 
mammalian host with Streptococcus pneumoniae cells, which 
antibodies are immunologically reactive with the 
polypeptide of paragraph (a), (b) , or (c) ; 

(e) polypeptides that are capable of 
eliciting antibodies that are immunologically reactive 
with the polypeptide of paragraph (a), (b) , or (c); 

(f) polypeptides that are immunologically 
reactive with antibodies elicited by immxanization with the 
polypeptide of paragraph (a), (b) , or (c) ; and 

(g) fragments of any of the foregoing 
polypeptides, either alone or in combination with other 
polypeptides to form a fusion protein. 

2. The polypeptide of claim 1, wherein the 
polypeptides of paragraph (d) are selected from the grouji 
consisting of polypeptides of the genera Streptococcus and 
En terococcus . 

3. The polypeptide of claim 1, wherein the 
polypeptides of paragraph (d) are selected from the group 
consisting of polypeptides of the species Streptococcus 
pneumoniae, Streptococcus agalactiae. Streptococcus 
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pyogenes, Streptococcus mutans, Streptococcus scixiguis, and 
Enterococcus faecalls . 

4. The polypeptide of claim 1, wherein the 
5 polypeptides of paragraph (d) are selected from the group 
consisting of polypeptides of the species streptococcus 
pneumoniae. Streptococcus agalactiae, and Streptococcus 
pyogenes . 

10 5. The polypeptide of claim 1, wherein the 

fragments of paragraph (g) are selected from the group 
consisting of amino acids 439-607 of SEQ ID NO : 5 (C-169), 
amino acids 457-607 of SEQ ID NO : 5 (C-151), amino acids 
527-541 of SEQ ID NO : 5 , and amino acids 586-600 of SEQ ID 

15 NO: 5. 

6* A polypeptide having the amino acid 
sequence of SEQ ID NO: 5 analogues, homologues and 
derivatives thereof . 

20 

7. A polypeptide having the amino acid 
sequence of SEQ ID NO: 20, analogues, homologues and 
derivatives thereof . 

25 8. A polypeptide having the amino acid 

sequence of SEQ ID NO: 22, analogues, homologues and 
derivatives thereof . 

9. A polypeptide having the amino acid 
30 sequence of SEQ ID NO: 26, analogues, homologues and 
derivatives thereof . 
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10, A polypeptide having the amino acid 
sequence of SEQ ID NO: 7, analogues, homologues and 
derivatives thereof . 

5 11. A polypeptide having the amino acid 

sequence of SEQ ID NO : 8 , analogues, homologues and 
derivatives thereof . 



10 



12. A polypeptide having the amino acid 
sequence of SEQ ID NO: 9, analogues, homologues and 
derivatives thereof . 



13. A polypeptide having the amino acid 
sequence of SEQ ID NO: 10, analogues, homologues and 

15 derivatives thereof. 

14. A polypeptide having the amino acid 
sequence of SEQ ID NO: 11, analogues, homologues and 
derivatives thereof . 

20 

15. A polypeptide having ^ the amino acid 
sequence of SEQ ID NO: 12, analogues, homologues and 
derivatives thereof . 

25 16. A polypeptide having the amino acid 

sequence of SEQ ID NO: 13, analogues, homologues and 
derivatives thereof . 

17. A polypeptide having the amino acid 
30 sequence of SEQ ID NO: 14, analogues, homologues and 
derivatives thereof . 
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10 



15 



20 



25 



18. A polypeptide having the amino acid 
sequence of SEQ ID NO: 15, analogues, homologues and 
derivatives thereof . 

19* A polypeptide having the amino acid 
sequence of SEQ ID NO: 16 analogues, homologues and 
derivatives thereof . 

20. A polypeptide having the amino acid , 
sequence of SEQ ID NO: 17, analogues, homologues and 
derivatives thereof . 

21. A polypeptide having the amino acid 
sequence of SEQ ID NO: 18, analogues, homologues and 
derivatives thereof . 

22. The polypeptide of any one of claims 1 to 21 
and 100-101, wherein said polypeptide elicits an immune 
reaction that is specific to Streptococcal strains. 

23. A polypeptide selected from the group 
consisting of: 

(a) the HSP72 polypeptide having the amino 
acid sequence of SEQ ID NO : 5 ; 

(b) polypeptides that are immunologically 
reactive with antibodies generated by infection of a 
mammalian host with Streptococcus pneumoniae cells, which 
antibodies are immunologically reactive with the HSP72 
polypeptide of paragraph (a) ; 



eliciting antibodies that are immunologically reactive 
with the HSP72 polypeptide of paragraph (a) ; 



(c) polypeptides that are capable of 
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(d) polypeptides that are immunologically 
reactive with antibodies elicited by immunization with the 
HSP72 polypeptide of paragraph (a) ; and 

(e) fragments of any of the foregoing 

5 polypeptides, either alone or in combination with other 
polypeptides to form a fusion protein. 

24. The polypeptide of claim 23, wherein the 

polypeptides of paragraph (b) are selected from the group 

consisting of polypeptides of the genera Streptococcus and 
En terococcus . 



10 



25. The polypeptide of claim 23, wherein the 
polypeptides of paragraph (b) are selected from the group 

15 consisting of polypeptides of the species Streptococcus 

pyogenes, Streptococcus mutans. Streptococcus sanguis, and 
Enterococcus f recalls . 

26. The polypeptide of claim 23, wherein the 
20 fragments of paragraph (e) are selected from the group 

consisting of amino acids 439-607 of SEQ ID NO : 5 {C-169); 
amino acids 527-541 of SEQ ID NO: 5, and amino acids 586- 
600 of SEQ ID NO: 5 . 

25 27. The polypeptide of claim 23, wherein the 

fusion protein of paragraph (e) is the Fucose Isomerase- 
HSP72 (C-169) protein having the amino acid sequence of 
SEQ ID NO : 3 . 



30 



28. A DNA sequence selected from the group 
consisting of : 

(a) the HSP72 DNA sequence of SEQ ID N0:4; 

(b) the HSP70 (DnaK) DNA sequence of SEQ 

ID N0:19; 
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(c) the HSP70 (DnaK) DNA sequence of SEQ 

ID NO:21; 

(d) DNA sequences encoding polypeptides 
that are immunologically reactive with antibodies 

5 generated by infection of a mammalian host with 

Streptococcus pneumoniae cells, which antibodies are 
immunologically reactive with the HSP72 polypeptide (SEQ 
ID NO : 5 ) ; 

(e) DNA sequences encoding polypeptides 
10 that are capable of eliciting antibodies that are 

immunologically reactive with the HSP72 polypeptide (SEQ 

ID NO : 5 ) ; 

(f ) DNA sequences encoding polypeptides 
that are imm\inologically reactive with antibodies elicited 

15 by immunization with the HSP72 polypeptide (SEQ ID NO:5); 

(g) DNA sequences that are degenerate to 
any of the foregoing DNA sequences; and 

(h) fragments of any of the foregoing DNA 
sequences, either alone or in combination with other DNA 

20 sequences to form a fusion DNA sequence. 

29. The DNA sequence of claim 28, wherein the 
DNA sequences of paragraph (d) are selected from the group 
consisting of DNA sequences of the genera Streptococcus 

25 and Enterococcus . 

30. The DNA sequence of claim 28, wherein the 
DNA sequences of paragraph (d) are selected from the group 
consisting of DNA sequences of the species Streptococcus 

30 pneumoniae/ Streptococcus agralactiae. Streptococcus, 

pyocrenes, Streptococcus mutans. Streptococcus sanguis, and 
Enterococcus faecalis . 

31. The DNA sequence of claim 28, wherein the 

35 DNA sequences of paragraph (d) are selected from the group 
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consisting of DNA sequences of the species Streptococcus 
pneumoniae, Streptococcus agalactlae, and Streptococcus 
pyogenes . 

5 32. A DNA sequence of the formula of SEQ ID 

NO: 4 from nucleotide 682 to nucleotide 2502, or 
derivatives thereof, coding for HSP72 . 

33. A DNA sequence of the formula of SEQ ID 
10 NO: 4 from nucleotide 1996 to nucleotide 2502, or 

derivatives thereof, coding for the C-169 fragment of 
HSP72 . 

34. A DNA sequence of the formula of SEQ ID 
15 NO: 4 from nucleotide 2050 to nucleotide 2502, or 

derivatives thereof, coding for the C-151 fragment of 
HSP72 . 

35. A DNA sequence of the formula of SEQ ID 
20 NO: 4 from nucleotide 2260 to nucleotide 2304, or 

derivatives thereof . 

36. A DNA sequence of the formula of SEQ ID 
NO: 4 from nucleotide 2437 to nucleotide 2481, or 

25 derivatives thereof. 

37. A DNA sequence of the formula of SEQ ID 
NO: 19 from nucleotide 204 to nucleotide 2027, or 
derivatives thereof, coding for HSP70 of Streptococcus 

30 agalactxae. 

38. A DNA sequence of the formula of SEQ ID 
NO: 21 f rom nucleotide 248 to nucleotide 2074, or 
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derivatives thereof, coding for HSP7 0 of Streptococcus 
pyogenes . 

39* A DNA sequence of the formula of SEQ ID 
5 NO: 25 from nucleotide 4 to nucleotide 456, or derivatives 
thereof, coding for the C- terminal 151-residue fragment 
(C-151) of HSP72. 

40* A DNA sequence coding for a polypeptide 
10 according to any one of claims 1-21 and 100-101. 

41. A DNA sequence selected from the group 
consisting of: 

(a) the HSP72 DNA sequence of SEQ ID NO:4; 

15 (b) DNA sequences encoding polypeptides 

that are immunologically reactive with antibodies 
generated by infection of a mammalian host with 
Streptococcus pneumoniae cells, which antibodies are 
immunologically reactive with the HSP72 polypeptide (SEQ 

20 ID NO : 5 ) ; 

(c) DNA sequences encoding polypeptides 
that are capable of eliciting antibodies that are 
immunologically reactive with the HSP72 polypeptide (SEQ 
ID NO : 5 ) ; 

25 (d) DNA sequences encoding polypeptides 

that are immunologically reactive with antibodies elicited 
by immunization with the HSP72 polypeptide (SEQ ID N0:5); 

(e) DNA sequences that are degenerate to 
any of the foregoing DNA sequences; and 

30 (f) fragments of any of the foregoing DNA 

sequences, either alone or in combination with other DNA 
sequences to form a fusion DNA sequence. 

42. The DNA sequence of claim 41, wherein the 

35 DNA secpiences of paragraph (b) are selected from the group 
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consisting of DNA sequences of the genera Streptococcus 
and Enterococcus , 



43. The DNA sequence of claim 41, wherein the 
5 DNA sequences of paragraph (b) are selected from the group 
consisting of DNA sequences of the species Streptococcus 
pyogenes. Streptococcus mutans. Streptococcus san^is, and 
Enterococcus faecalls. 



10 44. The DNA sequence of claim 41, wherein the 

fragments of paragraph (f) are selected from the group 
consisting of nucleotide 1996-2502 (amino acids 439-607) 
of SEQ ID N0:4 (C-169); nucleotide 2260-2304 {amino acids 
527-541) of SEQ ID NO: 4; and nucleotide 2437-2481 (amino 

15 acids 586-600) of SEQ ID N0:4. 



45* The DNA sequence of claim 41, wherein the 
fusion DNA sequence of paragraph (f) is the Fucose 
Isomerase-HSP72 (C-169) DNA sequence of SEQ ID NO : 1 
20 (nucleotides 771-2912) . 



46. An expression vector including at least one 
DNA sequence according to claim 41 operably linked to a 
promoter . 

25 

47. A recombinant DNA molecule comprising a DNA 
sequence according to any one of claims 28 to 40, and one 
or more expression control sequence operably linked to the 
DNA sequence . 

30 

48. The recombinant DNA molecule of claim 47, 
wherein said expression control sequence is an inducible 
expression vector. 
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49. The recombinant molecule of claim 48, 
wherein said expression vector comprises the X PL 
promoter . 

5 50 • A recombinant molecule according to claim 

47 consisting of a plasmid selected from the group 
consisting of: pURV3 , pURV4, pURVS , pURV6 , pJBD291, 
PJBDA4, pJBDkSl, PJBD177, pJBD171, pJBDl77, pJBD179, 
pJBDAl, pJBDfSl, and pJBDf62. 

10 

51. A unicellular host transformed with an 
expression vector of claim 46, 

52. A unicellular host transformed with a 
15 recombinant DNA molecule of claim 47 . 

53. A unicellular host according to claim 52, 
wherein said host is selected from the group consisting 
of: E.coli strains XLI Blue MRF • , W3110, JM109, Y1090 and 

20 BL21(DE3) . 

54 . A method for producing a polypeptide or 
fragment thereof comprising the steps of culturing the 
unicellular host of claim 51 and isolating said 
25 polypeptide or fragment . 

55. An antibody or fragment thereof that 
specifically binds to a polypeptide of claim 23. 

30 56. An antibody or fragment thereof that 

specifically binds to the epitope recognized by monoclonal 
antibody Fl-Pn3 . 1 . 
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57. The antibody or fragment of claim 55, which 
is a monoclonal antibody. 

58. The monoclonal antibody or fragment of 
5 claim 57, which is of murine origin. 

59. The monoclonal antibody or fragment of 
claim 58, which is of IgG type, 

10 60. The monoclonal antibody of claim 59, which 

is selected from the group consisting of Fl-Pn3.1, F2- 
Pn3 . 2 , F2-Pn3 . 3 , and F2-Pn3 . 4 . 

61. The monoclonal antibody Fl-Pn3.1. 

62. A method for isolating the antibody of 
claim 55 comprising: 

(a) introducing a preparation of the 
polypeptide of claim 23 into a mammal; and 

(b) isolating ser\im from the mammal 
containing said antibody, 

63. A method for isolating the monoclonal 
antibody of claim 57 comprising: 

25 (a) introducing a preparation of the 

polypeptide of claim 23 to antibody producing cells of a 
mammal ; 

(b) fusing the antibody producing cells 
with myeloma cells to form hybridoma cells, and 
30 (c) isolating said monoclonal antibody 

from the hybridoma cells. 

64 . A pharmaceutical composition comprising a 
polypeptide of claim 23 . 



15 



20 
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65. The pharmaceutical composition of claim 64, 
which is a vaccine. 

5 66. The pharmaceutical composition of claim 64, 

further comprising one or more pharmaceutically acceptable 
excipients . 

67. A method for preventing infection of a 

10 patient by Streptococcus pneumoniae or related bacteria 
comprising the administration of a pharmaceutically 
effective amount of the vaccine of claim 65. 

68. A pharmaceutical composition comprising one 
15 or more antibodies or fragments thereof according to 

claim 55. 

69. The pharmaceutical composition of claim 68^ 
which is a vaccine. 

20 

70. The pharmaceutical composition of claim 69, 
further comprising a pharmaceutically acceptable 
excipient . 

25 71. The pharmaceutical composition of claim 69, 

wherein the antibody is selected from the group consisting 
of Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3, and F2-Pn3.4. 

72. The pharmaceutical composition of claim 69, 
30 wherein the antibody is Fl-Pn3.1. 

73. A method for treating a patient infected 
with or suspected of being infected with Streptococcus 
pneumoniae or related bacteria conprising the 
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administration of a pharmaceutically effective amount of 
the vaccine of claim 69. 

74. A method for the detection of StrB^tococcus 
5 pneimioniae or related bacteria in a biological sample 
comprising : 

(a) isolating the biological sample from a 

patient; 

(b) incubating the antibody or fragment of 
claim 55 with the biological sample to form a mixture; and 

(c) detecting specifically bound antibody 
or fragment in the mixture which indicates the presence of 
Streptococcus pneumoniae or related bacteria. 

15 75. The method of claim 74, wherein the 

antibody is selected from the group consisting of Fl- 
Pn3.1, F2-Pn3.2, F2-Pn3.3, and F2-Pn3,4. 



10 



20 



25 



30 



76. The method of claim 74, wherein the 
antibody is Fl-Pn3 . 1 . 

77. A method for the detection of antibodies 
specific to Streptococcus pneumoniae or related bacteria 
in a biological sample comprising: 

(a) isolating the biological sample from a 

patient ; 

(b) incubating a polypeptide of claim 23 i' 
with the biological sample to form a mixture; and ' 

(c) detecting specifically bound 
polypeptide in the mixture, which indicates the presence 
of antibodies specific to Streptococcus pneumoniae or 
related bacteria. 
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78. A method for the detection of Streptococcus 
pneumoniae or related bacteria in a biological sample 
comprising : 

(a) isolating the biological sample from a 

5 patients- 
lb) incubating a DNA probe having the DNA 
sequence of claim 41 with the biological sample to form a 
mixture ; and 

(c) detecting specifically bound DNA probe 
10 in the mixture which indicates the presence of 
Streptococcus pneumoniae and related bacteria. 

79. The method of claim 78, wherein the DNA 
probe is an oligomer having a sequence complementary to at 

15 least about 6 contiguous nucleotides of a DNA sequence of 
claim 41 . 

80 • The method of claim 19, which further 

comprises : 

20 (a) providing a set of oligomers which are 

primers fo.y a polymerase chain reaction method and which 
flank the target region; and 

(b) amplifying the target region via the 
polymerase chain reaction method. 



25 



30 



81. The use of a pharmaceutically effective 
amount of the polypeptide of claim 23 for the prevention 
of Streptococcus pneumoniae or related bacterial 
infections in humans . 

82. The use of a pharmaceutically effective 
amotmt of an antibody specific to HSP72 for the prevention 
of Streptococcus pneumoniae or related bacterial 
infections in hiimans . 
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83 . A method, for prociucing a polypeptide ot 



10 



15 



20 



25 



fragment thereof comprising the steps of culturing the 
unicellular host of claim 52 or 53 and isolating said 
polypeptide or fragment. 

84. A polypeptide in substantially pure form as 
obtained by the method of claim 83 . 

85. An antibody or fragment thereof that 
specifically binds to a polypeptide of claim 1 or 22 . 

86 • A method for isolating the antibody of 
claim 86 comprising: 

(a) introducing a preparation of the 
polypeptide of claim 1 or 22 into a mammal; and 

(b) isolating serum from the mammal 
containing said antibody. 

87. A pharmaceutical composition comprising a 
polypeptide of claim 1 or 22 . 

88. The pharmaceutical composition of claim 87, 
which is a vaccine. 

89. The pharmaceutical composition of claim 87, 
further comprising one or more pharmaceutically acceptable 
excipients . 



patient by Stjreptococcus pneumoniae, Streptococcus 
pyogenes or Streptococcus agalactiae comprising the 
administration of a pharmaceutically effective amount of 
the vaccine of claim 88. 



90. 



A method for preventing infection of a 
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91. An antibody or fragment thereof that 
specifically binds to a polypeptide of claim 1 or 22 . 

92. A method for the detection of Streptococcus 
pneumoniae, Str^tococcus pyogenes or Streptococcus 
agalactiae in a biological sample coinprising: 

(a) isolating the biological sample from a 

patient; 

(b) inciibating the antibo<5y or fragment of 
claim 91 with the biological sample to form a mixture; and 

(c) detecting specifically bound antibody 
or fragment in the mixture which indicates the presence of 
Streptococcus pneumoniae. Streptococcus pyogenes or 

15 Streptococcus agalactiae. 

93. A method for the detection of antibodies 
specific to Streptococcus pneumoniae. Streptococcus 
pyogenes or Streptococcus agalactiae in a biological 
sample comprising: 

^ f*) isolating the biological san^le from a 

patient; 

(b) incubating a polypeptide of claim 1 or 
22 with the biological sample to form a mixture; and 

(c) detecting specifically bound 
polypeptide in. the mixture, which indicates the presence 
of antibodies specific to Streptococcus pneumoniae. 
Streptococcus pyogenes or Streptococcus agalactiae. 



25 



30 



94. A method for the detection of Streptococcus 
pneumoniae. Streptococcus pyogenes or Streptococcus 
agalactiae in a biological sair^ale comprising: 

(a) isolating the biological sample from a 

patient ; 
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(b) incubating a DNA probe having the DNA 
sequence of claim 28 with the biological sample to form a 
mixture; and 

(c) detecting specifically bound DNA probe 
5 in the mixture which indicates the presence of 

Streptococcus pneumoniae. Streptococcus pyog-enes or 
streptococcus agalactiae. 



95 • The method of claim 94, wherein the DMA 
10 probe is an oligomer having a sequence complementary to at 
least about 6 contiguous nucleotides of a DNA sequence of 
claim 28 . 



96. The method of claim 95, which further 

15 comprises : 

(a) providing a set of oligomers which are 
primers for a polymerase chain reaction method and which 
flank the target region; and 

(b) amplifying the target region via the 
20 polymerase chain reaction method. 



97. The use of a pharmaceutically effective 
amount of the polypeptide of claim 1 or 22 for the 
prevention of Streptococcus pneumoniae, Streptococcus 

25 p^^ogrenes or Streptococcus agalactiae infection in humans. 

98. The use of a pharmaceutically effective 
amount of an antibody specific to HSP72 for the prevention 
of Streptococcus pneumoniae, Streptococcus pyogenes or 

30 Streptococcus agalactiae infection in humans. 

99. The use of a pharmaceutically effective 
amount of a polypeptide according to any one of claims 2 
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to 21 for the prevention of Streptococcal infections in 
humans . 

100* A polypeptide having the amino acid 
5 sequence of SEQ ID NO: 23, analogues, homologues, or 
derivatives thereof . 

101. A polypeptide having the amino acid 
sequence of SEQ ID NO: 24, analogues, homologues or 
10 derivatives thereof. 
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spn-orf C T., 50 

sga-orf T A.. 50 

sgb-orf T A.. 50 

^ Consensus ATGTCTAAAA TTATYGGTAT TGACTTAGGT ACAACAAACT CAGCAGTWGC 50 

spn-orf A A GC C..A A..C. 100 

A sga-orf G T CA T. .T C. .T. 100 

sgb-orf G T CA T. .T C. .T. 100 

Consensus AGTTCTTGAA GGRACTGAAW SMAAAATCAT YGCWAACCCA GAAGGMAAYC 100 

spn-orf .C A..T C C..A C T... 150 

sga-orf .T T..A A T..T T G... 150 

sgb-orf .T T..A A T. .T T G. . . 150 

Consensus GYACAACTCC WTCWGTAGTM TCATTCAAAA AYGGWGAAAT YATCGTKGGT 150 

spn-orf T A. .T C T..A. ,T 200 

sga-orf C A..G C A, .A. .A 200 

sgb-orf T G. .A T T..T. ,T 200 

Consensus GATGCTGCAA AACGYCAAGC RGTDACAAAY CCAGAWACWG TWATCTCTAT 200 

spn-orf C T..G . . ,a 250 

sga-orf T T..A T 250 

sgb-orf C A..G X 250 

Consensus YAAATCWAAR ATGGGAACTT CTGAAAAAGT TTCTGCAAAT GGWAAAGAAT 250 

spn-orf .C A C T C T. G C..C 300 

sga-orf .T T T A T C. T T. .T 300 

sgb-orf .T T T A T C. T T..T 300 

Consensus AYACTCCWCA AGAAATYTCA GCWATGATYC TTCAATACYT KAAAGGYTAY 300 

spn-orf C T.. G ACC T C. .A 350 

sga-orf T A.. A GAA A T. .T 350 

sgb-orf T A.. A GAA A T. .T 350 

Consensus GCTGAAGACT AYCTTGGWGA RAAAGTARMM AAAGCWGTTA TYACWGTTCC 350 
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J sgb-orf ....A C C A C. .A 450 
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spn-orf T A ..A G. .A. . 500 

sga-orf A G . .T A. .T. . 500 

sgb-orf A G ..T A. .T.. 500 

Consensus CTTGCTTATG GTWTGGACAA GACTGACAAR GAWGAAAAAA TCTTRGTWTT 500 
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spn-orf .A.. A A T G T 1050 

sga-orf .T..A G T T G 1050 

sgb-orf .T..T G C T T 1050 

Consensus CWGTWAACCC TGATGAAGTR GTTGCYATGG GTGCKGCTAT CCAAGGTGGK 1050 

spn-orf ..G..T T C. . G..T C T G 1100 

sga-orf ..T.,C G G.. A. .C C C A 1100 

sgb-orf ..T..C G G. . A..C A C A 1100 

Consensus GTKATYACTG GKGATGTSAA RGAYGTTGTM CTTCTTGAYG TAACRCCATT 1100 

spn-orf C A.. A..T..A... C.T....:^. 1150 

sga-orf T T. . C. .C. .T. . . T.G C. 1150 

sgb-orf T T. . C..C..T-.. T.G C. 1150 

Consensus GTCACTTGGT ATYGAAACAA TGGGTGGWGT MTTYACWAAA YTKATCGAYC 1150 

spn-orf ....C..T.. A 1200 

sga-orf ....T..A.. T 1200 

sgb-orf ....C..A,. T 1200 

Consensus GCAAYACWAC WATCCCAACA TCTAAATCAC AAGTCTTCTC AACAGCAGCA 1200 

spn-orf C 1250 

sga-orf T 1250 

sgb-orf T 1250 

Consensus GACAACCAAC CAGCCGTTGA TATCCAYGTT CTTCAAGGTG AACGCCCAAT 1250 

spn-orf G,.T. .T..A 1300 

sga-orf G..T. .T.,T 1300 

sgb-orf A. .A. .C..T 1300 

Consensus GGCAGCAGAT AACAARACWC TYGGWCGCTT CCAATTGACT GATATCCCAG 1300 

spn-orf T..T C C C..G 1350 

sga-orf C..A T T T. .A 1350 

sgb-orf C. .A T T T. .A 1350 

Consensus CTGCACCTCG TGGAATYCCW CAAATYGAAG TAACATTTGA YATCGAYAAR 1350 

spn-orf ..C C. .G T. . G..C C ..T..A..T A 1400 

sga-orf ..C T. .T A.. A. .T C ..T..T..G G 1400 

sgb-orf ..T T. ,A T.. A. .T T ..C..T..T A 1400 

Consensus AAYGGTATYG TDTCTGTWAA RGCYAAAGAY CTYGGWACKC AAAARGAACA 14 00 
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AMMYATYGTY ATCMAATCDA AYKMMGGWYT DWCTGAHGAA GAAATYGAYM 14 50 



spn-orf 
sga-orf 
sgb-orf 

Consensus 



GC T..A A..C..T. ..T.C...AA G. 

GC C..T T..T..C. ..G.C...GC G. 

AA T..T A,.T..T. ,.G.A...GC A. 



1500 

1500 

1500 

RMATGATGAA AGAYGCWGAA GCWAAYGCYG AAKCMGATRM RAAACGTAAA 1500 
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0 r £ 
sgb-orf 


T . 

T. 


C AAA 


C. T- . . 


T . 


T T 


1550 
1550 


Consensus 


GAAGAAGTWG 


AYCTTMRWAA 


YGAAGTKGAC 


CAAGCHATCT 


TTGCDACWGA 


1550 


spn-orf 
sgb-orf 


A A . C 


. .G 

. .A 

. .A 


... T ... 

C 


C T CA 


C T 


1600 
1 600 
1600 


Consensus 


AAARACWATY 


AARGAAACTG 


AAGGYAAAGG 


YTTYGAYRCA 


GAACGYGAYG 


1600 


spn-orf 
sga-orf 
sgb-orf 


.T. .C. . .G. 
. A . . 0 . . . 1 . 
.A. .G. . .T. 


T. .C T 

A. .A T 


. . CO. T . . GA 
. .GT.G. .AA 


AA. .T 

AA. .T 


. GACAAC . . . 
. TCAGGT . , . 


1650 

1 DDL) 

1650 


Consensus 


CWGCSCAAKt 




Cj A b y T D AARK 


MWGCKCAAGA 


AKMHRRYAAC 


1650 


spn-orf 
sga-orf 
sgb-orf 




A. . 

T 




T.G. .C. . . . 
C.T. .C 




1700 
1700 
1700 


Consensus 


YTKGACGACA 


TGAAAGCWAA 


ACTTGAAGCW 


YTDAAYGAAA 


AAGCDC7\AGS 


1700 


spn-orf 
sga-orf 
sgb-orf 


TO. T . . A . . . 


. . .C.C. . . , 
. . . A.G. . . . 
. . .C.T. . . . 


.G T. . 

.A G. . 






1750 
1750 
1750 


Consensus 


WYTKGCWGTT 


AAAMTBTACG 


ARCAAGCBGC 


WGCAGCDCA.A 


CAAGCWSMWS 


1750 


spn-orf 
sga-orf 
sgb-orf 


A. .A. . 

. . . .T. .A. . 


. - .C 

. . .T 


G. AA 

G.TAA 


. . GG . . ACGC 
. .GC. .ATAA 
. .TC. .GCAA 


A. .C C 

T T 


1794 
1797 
1800 


G. .T. . 


. . .T 


T.AGC 


G...T T 


Consensus 


AAGGDGCWGA 


AGGYGCACAA 


KCWRMTGATT 


CAKSAARYRM 


RGGYGATGAY 


1800 


spn-orf 
sga-orf 
sgb-orf 


. .C C. 

. .T T. 

. .T T. 


. A. .G. .T. . 
.C. . A. .T. . 
.C. . A. .C. . 


T. .G. . A. . . 






1824 
1827 
1830 


Consensus 


GTYGTAGAYG 


GMGARTTYAC 


DGARAARTAA 






1830 
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spn-prot 50 

sga-prot 50 

sgb-prot 50 

Consensus MSKUGIDLG TTNSAVAVLE GTESKIIANP EGNRTTPSW SFKNGEIIVG 50 

spn-prot 100 

sga-prot E 100 

sgb-prot lOO 

' Consensus DAAKRQAVTN PDTVISIKSK MGTSEKVSAN GKEYTPQEIS AMILQYLKGY 100 

Spn-prot T 150 

sga-prot 150 

sgb-prot 150 

Consensus AEDYLGEKVE KAVITVPAYF NDAQRQATKD AGKIAGLEVE RIVNEPTAAA 150 

spn-prot L E S 200 

sga-prot 200 

sgb-prot 200 

Consensus LAYGMDKTDK DEKI1.VFDU3 GGTFDVSILE LGDGVFDVLA TAGDNKLGGD 200 

spn-prot H T....M S.. 250 

sga-prot 250 

sgb-prot E 250 

Consensus DFDQKIIDFL VAEFKKENGI DLSQDKHALQ RLKDAAEKAK KDLSGVTQTQ 250 

spn-prot E T. T V 300 

sga-prot 300 

sgb-prot 300 

Consensus ISLPFITAGS AGPLHLEMSL SRAKFDDLTR DLVERTKTPV RQALSDAGLS 300 

Spn-prot 350 

sga-prot 350 

sgb-prot 350 

Consensus LSEIDEVILV GGSTRI PAW EAVKAETGKE PNKSVNPDEV VAMGAAIQGG 350 

spn-prot 4*00 

sga-prot 400 

sgb-prot 400 

Consensus VITGDVKDW LLDVTPLSLG lETMGGVFTK LIDRNTTIPT SKSQVFSTAA 400 

spn-prot 4 50 

sga-prot 4 50 

sgb-prot 4 50 

Consensus DNQPAVDIHV LOGERPMAAD NKTLGRFQLT DIPAAPRGIP QIEVTFDIDK 450 

spn-prot T S,K... 500 ■ 

sga-prot K..D. .SE 500 ^ 

sgb-prot K 500 ' 

Consensus NGIVSVKAKD LGTQKEQHIV IQSNSGLTDE EIDRMMKDAE ANAEADAKRK 500 

spn-prot R A A... D DN. 550 

sga-prot A 550 

sgb-prot 550 

J Consensus eevdlknevd oai FATEKTI KETEGKGFDT ERDAAQSALD ELKKAQESGN 550 

spn-prot G QE TGN.G-... 598 

sga-prot M N. . .NN. . . 599 

sgb-prot SA. .SSKG.. 600 

Consensus LDDMKAKLEA LNEKAOALAV KLYEQAAAAQ QAAQGAEGAQ A.DSA. .-DD 600 

spn-prot 607 . 

sga-prot 608 

sgb-prot 609 

Consensus WDGEFTEK 609 
DOCID: <WO_9640928A1_L> 

suBsrrruTE sheet (rule 26) 



wo 96/40928 



PCT/CA96/00322 



22/22 



I 2 3 4 

974- 
68- 

43- 

25- 

37~43 3743 



SUBSTTTUTE SHEET (RULE 26) 

964092BA1_I > 





INTERNATIONAL SEARCH REPORT 



Intel inal Af^ieation No 

PCT/CA 96/0G322 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/31 C07K14/315 C12N1/21 
A61K39/395 G01N33/569 C12Q1/68 



C07K16/12 A61K39/09 



According to Intemabonal Patent OasaBfication (IPC) or to both national clasrincation and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (datsification system followed by classification symbols) 

IPC 6 C12N CQ7K A6IK 



Documemation searched other than min'mum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where pracDcal, search terms used) 



C. DOCUMEhffS CONSIDERED TO BE RELEVANT 



Category * Citation of document, witti indication, where appropriate, of the relevant passages 



Relevant to claim No. 



GENE, 

vol. 142, no. 1, 3 May 1994, AMSTERDAM 
NL, 

pages 91-96. XPO02O13303 
M.J.S.BARRIL ET AL. : 'Cloning and 
sequencing of the Lactococcus lactis 
subsp. lactis dnaK gene using a PCR-based 
approach" 
see figure 3 

W0,A,93 17712 (BIOCINE SCLAVO SPA) 16 
September 1993 
see page 50 

WO, A, 92 14488 (UAB RESEARCH FOUNDATION) 3 

September 1992 

see the whole document 



1.2, 

6-21,23, 
28,41 



1-101 



1-101 



I [ Further documents are listed in the continuation of box C. 



|X I l*dtent family members are listed in annex. 



* Special categories of dted documents 
-A 



document defming the general state of the art which is not 
considered to be of particular relevance 

'E' earlier document but published on or aRer the international 
filing date 

'L* document Miich may throw doubts on priority daimfs) or 
which is a ted to establish the publication date of another 
citation or other special reason (as specified) 

O' document referring to an oral disclosure, use, exhibition or 
other means 

'P* document published prior to the international fUing date but 
later than the priority date claimed 



"T* later document published after the intemadonal filing date 
or priority date and not in conflict with the application but 
dted to understand the pfindplc or theory underlying the 
invention 

*X' document of p^cular relevance; the daimed invention 
cannot be considered novel or carmot be considered to 
involve an inventive step when the document is taken alone 

'Y' document of particular retevancr, the daimed invention 
carmot be considered to involve an inventive step when ttie 
document is combined with one or more other such docu- 
ments, such combination bang obvious to a person ddlled 
in the art 

document member of the same patent family 



Date of the actual completion of the international search 

13 September 1996 


Date of mailing of the international search report 

2 7. 09.96 


Name and mailing address of the ISA 

European Patent Oflice, P.B. S81S Patentlaan 2 
NL - 2280 HV Rtjswijk 
Tel. ( + 31 -70) 340-2040. Tx. 31 6S1 epo nl. 
Fax: (t^SNTO) 340-3016 


Authorized officer 

Cupido, M 



BNSDCX;iD: <WO_9640928A1J_> 



INTERNATIONAL SEARCH REPORT 



rnaiionaJ applicaiion No. 

PCT/CA96/ 00322 



Box 1 Observadons wha% certain daims were found unsearchable (Continuation of item 1 of first sheet) 

This intemational search report has not been estabfished in respect of certain claims under Article 1 7{2){a) for the following reasons: 

1. |Y| ciaimi No5.: 67.73,81,82,90,97-99 

because they relate to subject matter not required to be searched by this Authority, namely. 

Remark: Although claims 67,73,81,82,90 and 97-99 are directed to a method 
of treatment of the human/animal body, the search has been carried out 
and based on the alleged effects of the composition. 

2. Claims Nos,: 

because they relate to parts of the iniernaiionai application that do not comply with Che prescribed requirements to such 
an extent that no meaningful international search can be carried out, speciflcaily: 



3. Claims Nos.: 

because they are dependent daims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation tif item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



As all required additional search fees were timely paid by the applicant, this iniemational search report covers all 
searchable claims. 

2. I I As all searchable claims could be searches without effort justifying an additional fee. this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this iniemational search report 
covers only those claims for which fees were paid, spedfically claims Nos.: 



I I No required additional search fees were timely paid by the applicant Consequently, this international search report is 
restrictied to the invention first mentioned in the daims; ii is coined by daims Nos.: 



Remark on Protest | [ The additional search fees were accompanied by the applicant's protest 

I I No protest accompanied the payment of additional search fees. 



Form PCTiISAaiO (continuation of firsi sheet (1)) (July 1992) 



BNSDOCID: <WO_9640928A1J.> 



intekRx' 



XTIONAL SEARCH REPORT 

«fonnation on patent family members 



Inten lal Application No 

PCT/CA 96/00322 





1 n • 

1 Puulic&tion 


Patent family 


Publication 


cited in search report 


1 date 
— i 


inember(s) 


date 


WO-A-9317712 


16-09-93 


CA-A- 


2131551 


16-09-93 






EP-A- 


G632727 


11-01-95 






JP-T- 


75G4423 


18-05-95 


WO-A-9214488 


03-09-92 


AU-B- 


667668 


04-04-96 






AU-A- 


1445392 


15-09-92 






AU-A- 


4798096 


27-G6-96 






EP-A- 


0571538 


01-12-93 






JP-T- 


6504446 


26-05-94 






US-A- 


5476929 


19-12-95 



Fottn PCT/ISA.aiO (patent funUy annn) Uuly 1992) 



BNSDOCID: <WO_9640928A1_L> 



